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This overview stresses the equilibrium/ non-equilibrium nature of the 
physical principles, as well as the basic properties of the models, used 
to calculate critical misfit and critical thickness in epitaxy. 

Hierdie oorsijpd>eklemtoon die ewewigs/nie— ewewigs karakter van die 
. fisiese beginsels, sowel as die basiese eienskappe van die modelle wat 
gebruik word om kritieke wanpas en dikte in epitaksie te bereken. 

t 

Introduction / 

The importance of the transition of an epitaxial interface between 

crystals A and B from coherency to incoherency with misfit dislocations 

(MD's) needs no motivation. The transition depends primarily on the 

misfit f at the interface. The general specification of misfit has been 
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discussed by Jesser and Braun . In the case of rectangular interfacial 
Jyrametry we may write 

f * (a - b)/c (1) 

where c ■ (a+b) when A and B are both thick and c * a or b when A is 

3-5 

thin. For crystals with different symmetries, e.g. (Ill) f.c.c./(I<0) 
'.c.c. interfaces, the misfits in two orthogonal directions may be , 
significantly different. 

The following modes of misfit acoormodation (MA) will be considered: 

(a) a misfit vernier (MV) when A-B is rigidlike^, (b) misfit strain (MS)^; 
crystals homogeneously strained to reduce disregistry, (c) misfit 
dislocations (MD's) ' (d) and a misfit strain gradient (MSG) from 
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lattice parameter a of A to b of B. MD's may be efficient (EMD), 
inefficient (IMD), of screw character (SMD) or split up into partials. 
Often two or rte modes of MA are present. 

If MD's (f) and MS ( e ) coexist , the portion f of f accommodated by MD's 
satisfies (to first order) for quadratic symmetry the relations'*’^’ ^ 

f « Pf + Qe, e = 1 e | , (2) 


where (i) P *Q*1 for a thin layer on a thick substrate (ii) P *cosX, Q=*l 

when the MD is inefficient; X heing the inclination of the Burgers vector 

to the interface and (iii) P «J, Q = 1 +r *R 1 when A and B are both thin 

with r *h a /h^, R 55 and e * je^l. R° r different symmetries MS, MD's 

and a MV may coexist. Our main aim is to report on calculations of 

critical misfit f when a monolayer (ML) is subcritical (f *0) and the 

critical thickness h c at which such a system becomes critical (f ^=0). 

We need to define the physical principles governing calculations of f 

and h c , clearly. Consider a growing epilayer which started as a coherent 

(f *0) ML. The acquisition of MD's is subjected to a hierarchy of energy 

barriers U both in creation and motion (Peierls). The acquisition is 

driven by the free energy gradients (AA) , aided by thermal energy 

(kT , T = substrate T) and proceeds towards the free energy (A) minimum 
s s 

defining the equilibrium, i.e. 

min. = A E, (3) 


where E is the energy. The observed configuration depends on AA, 

kT g , U, the deposition rate an d any waiting time'. Furthermore, the 

critical parameters will also depend on the growth mode; island growth, 
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ML-by-ML growth and Stranski-Krastanov (islands on top of ML-by-ML). 

Almost all theories model the AA and BB interaction in terms of 

3 4 5 

continuum elasticity (harmonic approximation). * * Recently ab initio 
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calculations, using appropriate potentials, have been carried out too 0 
The periodic A B field (potential V(x,y)) has been cast either in the 
Volterra model^, which invokes crystal dislocations to accommodate f, or 

“j 

the Frenkel-Kontorowa (FK) * model that expresses V(x,y T in terms of 

a (truncated) Fourier series. Whereas the Volterra model has the 

advantage of the mathematical simplicity the FK model allows for 

adjusting the AB strength as needed. Furthermore, specific epitaxial 

orientations are linked to specific Fourier terms^ as matching of A and B 
2 

wave vectors ("atomic row matching"), whereas the magnitude of the 

relevant coefficient poses as an energetic measure of the tendency to the 

realization of the specific epitaxy. 

In the FK model the self energy of a MD between two thick 

3 

epicrystals is given by 


y , cp , . 

e; = - 21 — [ 1 + 8 - (1 + 8 2 ) 1 - 0£.n{2£ O + e 2 )* - 2S 2 }] 
D 4tt 2 


(4a) 


2T V i 
U ab p ’ ^7 


1 - V 1 - V, 


- , c = j(a + b) 


(4b) 


where p is the MD spacing and the other symhols have evident meanings. 
The Volterra model introduces a "cut off radius R" such that for an 
epilayer of thickness h on a thick substrate"* the line energy of an EMD 
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becomes 


E i> 


Db 

T 


«n(f) 


I), d - — R - ( p/2 ohm h a > p/2 
’ TT * l h when h- < d/2 


when h- < p/2. 


(5) 


This expression must be modified when the MD's^re inefficient or partial 
and the Burgers vector b is essentially the c in Eqs. (4). When p becomes 

large (4) reduces to (5) but with smaller value of R. When the misfit is 
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partly accommodated by MS e the MS energy, which is of the form ' 


E 


Bh e 2 
a a 


2y a (l + v a ) 
1 -v 


h i 2 

a a 


per unit area. 


(6) 
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for interfaces with quadratic symmetry, oust be included. 

We need criteria defining f and h . The criterion for the 

c c 

- 4 5 

equilibr >'jr. MS e (or MD density f ) is defined by * 
m m 


3E 

3e 


0 


— for e * e , 
3f 



(7a) 


using eqs. (2) -(3). Furthermore, from 3E/3f = 0 we may infer that the 
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work done to introduce a MD vanishes, i.e. that * * 


W = /Fds = 0, or F = 0 when F constant; e = e^ , f = f , (7b) 


where, either F is the net force on a dislocation and ds its displacement, 

or F is an applied force employed to introduce the dislocation and ds the 

displacement of its "point" of action. The case F =0 applies when a MD 

is generated from an existing (threading) dislocation^. The conditions 

for critical misfit f £ (f *0) at given h and critical thickness h £ at 
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given f may be written as ’ 


4>(h,f c ;f =0) = 0 and 4>(h ,f;f - 0)» 0, (8) 

where <f> is one of the functions in eqs. (7). In the FK model the 

4 

transition to incoherency is sharp . This gives credibility to 

E_(h,f c ;i = f) = E D (h,f c ;f » f) (9) 

9 1 0 

as an approximate criterion for f £ . This may however overestimate * f 
and h £ by nearly 25%. - 

Equilibrium considerations assume that MD T s are freely available. In 

4 

general energy barriers of the form 

U = / bar Fds; bar = barrier (10) 

exist, where the integration is from hottom-to-top of barrier. Typical 
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barriers are the nucleation and Peierls barriers. * The barrier heights 

are reduced by MS (Peach-Kohler) driving forces F_. A special case 

e 

obtains when the opposing hD line tension F n is overcome by i.e« 

u e 

F = F - F =0, for e * f 0 0 

\i g 

and MD's generate spontaneously without the aid of thermal energy kT . 
When U/kT s ^ 1 the barriers are completely prohibitive and the system will 
remain almost indefinitely in a metastable configuration. When U < kT 

s 

the attainment of equilibrium takes time and the observed configuration 
may differ significantly from equilibrium. 


Equilibrium calculations of critical misfit and thickness 

Consider the application of the equilibrium criteria (7) and (8) to 

A • • 

layerlike growth. Frank and van der Merwe did this for a one-dimensional 
(1-D) FK model in which the AA (harmonic) and AB (sinusoidal) interaction 
strengths were respectively embedded in a force of constant y and an overall 
amplitude W„ They obtained the (ML) critical misfit f^^ and critical 
thickness h £ as (See eqs. (8)) 

f* J) - 2 /tt£ 0 , = nby (1) b/2W ; n=l for a ML, (12) 

h * n b - 8W/7r 2 y (1) bf 2 , (13) 

c c 

where of and W in (12) and (13) stresses the importance of field 

strengths; f^^ ~ 9% for an average case. 

This calculation neglects strain in the substrate, the normal strain 
gradient in the overlayer and the lateral Poisson (v) contraction. The 
latter is easily taken care of yielding for quadratic symmetry the results' 


(1) _ 2 n 2 _ y a hb2 . _ 4(l-v)W 

c 1 c b 2 f 2 ’ 

a 


( 14 ) 
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where y is now the shear modulus of A. This diminishes from 9Z 

3. C 

to about 7%. With different symmetries, as for (111) f.c.c./(110) b.c.c. 
271213 

interfaces * * ’ the misfits in two ortowonal directions are different 

and a variety of MA modes coexists. Also for crystals, other than ML’s, 
2-D coherency is extremely rare, whereas 1-D coherency follows the same 
pattern as for quadratic symmetry. 

The first calculation to allow properly for relaxation in the substrate 

and a strain gradient normal to the interface was done using the parabolic 

9 

(P) model and yielded the exact solution 


1 -V -00 {jx 

f c (h) - 4it (1+v) o xl x+a<j;(x,a) J * 


a * 


2 (l-v)h 


A/ _ -x x(s 2 +cs-x-x 2 )+a(c+s) 2 /(l-2v) 
a) * 2 2 2 2 » 

x(s -x )+a(s +cs+x-x )/(l-2v) 


s - sinhx 
^c = coshx 


(15) 


for f c (h) or equivalently f(h £ ) in crystals with equal elastic constants. 
With different elastic constants f c (h) has the same form but <p is vastly 
more complicated. ^ Matthews^ obtained using the Volterra model (eqs. (5) 
and (6)) 

f c • * 11 • h c ' 1ST ♦ 11 • < l6 > 


Equation (16) is inaccurate at small thicknesses where the "cut off" 
approximation is poor and the precise AB field strength, which (16) 
ignores, becomes important. It may be modified though to account for 
inefficient and partial MD’s. 

Most systems exhibit island growth (Volmer-Weber or Stranski- 
Krastanov),! * This case was first addressed by Cabrera^ assuming a hemi- 
spherical epitaxial island A of radius R on a thick substrate B and 
equilibrium MS’s e^ and e™ . For the critical misfit e^ + e^ * f £ , 
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while also p * 2R+. When y * U, * y and a - a -a it follows, (to 
r a D x y 

first order) using the strain energy (log. term) in Eq. (4a) that 
f c (R) " 2B 2 ] , B(R) - 2 (3 tR * ° 7) 


For given misfit f, Eq. (17) defines a critical size R c . 

9 1 0 

Recently f c (h) has been calculated for a superlattice * using the 
exactly solvable parabolic model in conjunction with the assumption that 
the stack of layers remains plane: 


f c (h;r,R) 


(l-v b )(l+r _1 R" 1 )y 

47r(l+v)y, 

b 


f 00 dx 

o xi x+a(n)4>(x)J ’ 


(18) 


where a (n) = 2(l-v)h/a and <}>(x) is a complicated function of elastic 

constants and thicknesses, and r and R are defined in eqs. (2). By 

criterion (8) this also defines a critical thickness h * h c> When the 

layer thicknesses and elastic constants are respectively equal eq. (18) 

simplifies greatly. h c /a varies approximately as f ; K * 1 . 22 compared 

to K = 2 in (14) and K =1 in (16). When h/a is small E* U ^ ~ E^/2 where E^ 

relates to a single layer on a thick substrate. For thick layers, 

Ef u ^ ~ E . Furthermore f su P ®= 4 f and h su ^ ** 4h . 

D D c c c c 

Matthews 5 has calculated f £ and h £ for a thin epilayer on a thick 
substrate using the (Volterra) force criterion in (7h) and included the 
line tension O o of a step formed concurrently at the free surface and the 
force Y 0 needed for stacking fault formation: 

Db ( 1 -v cos 2 a)[Hn(h /b) + l]+ a b sina 

h = £ 09) 

c Bbf cos A-y 0 /cos * 


A being the angle between the slip direction and the direction in the 
interface normal to the line of intersection of the slip plane (S) and 
the interface, a is the angle between the MD and its Burgers vector b 
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(applicable to partials) and 4> is the angle between the free surface and 
the normal to S. 

MD’s may also be generated by climb. Matthews'* calculated for this 
case invoking partial MD’s and obtained, using the criterion (7b): 


h 

c 


D( £n(h/b) +1] ±ct 

0 

Bf /b cos <p 


( 20 ) 


where the ± signs correspond to surfaces being created or annihilated by 
climb and is the surface free energy. This mechanism is dependent on 
the availability of point defects. 


Calculations of critical parameters (non-equilibrium) 

We briefly consider quantities relevant to criticality under non- 
equilibrium conditions: the activation barriers U (eq. (12)) to the 

n 

formation of MD's and their reduction by MS, the nucleation energy of MD 

loops, surface (image) and Peierls barriers, and the dynamics of MS relief. 

4 

Frank and van der Merwe have first shown that in the FK model the barrier 
to the injection of a MD at the free end of a coherent monolayer* (ML) is 


u (,) = (4W Z /tt)[( l-f 2 £ 2 )* - (w/2 - arcsin fZ )fZ ] 
n o o oo 


( 21 ) 


per atom row, (compare eqs. ( 10 ) . has a maximum value of 4 w£ o /tt ^ 9W 

at f=0, diminishes to about 2W at the coherency instability 2/tt£, 0 in 

eq. (12) and vanishes at f^^ for a ML or h for a multilayer (analogy of 

s s 

eq. (13)): 


f (1) = \/Z n = TTf (1) /2; h 

S o C * s 


ir 2 h /4 * 2h . 
c c 


(23) 


whereafter MD’s enter freely without thermal aid. In metals 

W ~ 0.2 - 0.5 eV. The misfit f ^ is about 14% which is reduced to about 

s 

11% if we use the more appropriate ML relations in eqs. (14). 
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Continuum theory predicts* ^ that the nuoleation energy of a MD 

loop in the absence of MS is about 200 eV which completely rules out 

the creation of such loops by thermal energy clcne. The reduction of 
— 1 A 

U n was also been considered by Dodson in the case of Si. He employed 

an atomistic approach, using modified Tersoff potentials and obtained an 

f 88 11%. This is significantly larger than the continuum prediction of 
s 

4% and equals exactly the 11% in (23), which is somewhat fortuitous. 

More recently Tsao et. al.*^ have considered the overcoming of 

Peierls barriers and measured the MS relief in metastable Si Ge 

x j-x 

strained layers grown on (001) Ge substrates. They concluded that 
strained layer breakdown is most directly correlated with (1) an excess 
(driving) stress 2(F_ - F n )/hb given by 


2ey(l + v) y(l -v cos 2 a) Jln(4h/b) 

a exc = 1 - v 2*0 - V) h7b ; 6 " f 


(24) 


and (2) temperature, where a - 60° is the angle between the MD line and 

its Burgers vector. Whereas o gxc = 0 is still valid for equilibrium, in 

the relevant metastable configuration, the energy barriers delay the 

transition to equilibrium. At a temperature of 494° C MS relief becomes 

observable when c /y 88 0.024. Since in the derivation of o the MD's 
exc exc 

are generated from threading dislocations the relevant barriers are 
presumably the Peierls' barriers. The excess stress defines an excess 
force F exc * j ^ a exc acting on the dislocation threading the overlayer)^ 
and when displaced by x reduces the barrier U(x) to AU(x) ■ U(x) - xF , 
or to 6U(x) % A(x)L/h for a critical segment length L jumping the barrier. 
When |6u| ^ is small enough compared to the thermal energy kT g the jump 

frequency increases to the extent that MS relief becomes observable. 
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An interesting case of critical thickness concerns epilayers on 

substrates of finite width. It makes use of a significant misfit strain 

8 

gradient (MSG) to accommodate the misfit f. Luryi and Suhir adopted a 

rigid substrate of width 21 along the 1-D misfit direction, a coherent 

epilayer of the same width and an exponentially decaying strain field 

with distance normal to the interface. They used a quasi-nucleation 

criterion and showed that when J l is small in comparison to the thickness 

h, there is an effective thickness h smaller than h that characterizes 

e 

the limit of the strain field, i.e. that h ~ l when h > i and h h 

e e 

when l ^ h. These considerations are qualitatively very significant 

because they point towards a new mechanism of avoiding MD's. 

] 8 

Recently Dodson and Tsao investigated dislocation dynamics based on 
a phenomenological model and assuming that (i) initially f *0, (ii) the 
biaxial MS is relaxed by a MD network, (iii) the rate process mechanism 
is the equisition of MD's, (iv) the motion of dislocations is driven by 
a localized stress and opposed by energy barriers and (v) T £ 0c can 

be expressed in terms of the residual mismatch f -f(t) - e(h), f(t) being 
the (insufficient) density of MD's and e(h) the equilibrium MS at 
thickness h. Starting with a nearly dislocation free coherent layer and 

introducing a background dislocation density f , the authors obtained the 
governing equation 

df(t)/dt * Cy 2 [ f „ - f(t) - e(h)] 2 (f (t) + f Q ), (25) 

where C is a constant depending on temperature. This equation is claimed 
to give a good description of the data for SiGe alloys on (100) Si 
substrates using as model parameters C =30.1 and f * 1 0 - ** . These 

Q 

considerations constitute significant progress towards a successful 
dynamical description of strain relief in non-equilibrium cases. 
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Concluding remarks 

In the past, calculations of critical misfit and thickness had mainly 
been cat, led out using equilibrium principles because of their simplicity. 
Recent observations have displayed a significant discrepancy between 
theoretical and experimental values, particularly in semiconductors. This 
phenomenon which has been mainly ascribed to non— equilibrium effects, has 
become topical recently. This paper attempts to put the models and 
physical principles of the predictions into perspective. Clearly the 
space allowed is completely inadequate to do justice to this endeavour, 

A more complete overview is in preparation. 
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Descriptions of Low Energy Misfit Dislocation Structures using the Parabolic 
Interaction Potential 

Jan H. van der Merwe 

Department of Physics, University of Pretoria, 

Pretoria 0002, (South Africa) 
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ABSTRACT 

The need for and development of the parabolic model as a power series 
approximation of the sinusoidal Frenkel— KOntorowa model, used to study low- 
energy misfit dislocation (MD) structures in epicrystals, is briefly discussed: Its 
application to monolayers on thick substrates, thickening epilayers, superlattices 
and structural ledges is outlined with special emphasis on critical misfit and 
thickness in coherent to incoherent transitions. The "nucleation" of a MD by 
climb from the free surface is calculated. The critical misfits obtained from (a) 
the continuum (approximation) and exact solutions, using the parabolic model 
and (b) the continuum solutions of the sinusoidal and parabolic models, are 
compared and the differences of the various approaches assessed. 
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1. INTRODUCTION 

Intercrystalline boundaries play an important role in the physical behavior 
of crystalline systems, e.g. in plasticity and the electrical and optical properties 
of epitaxial films. In the present considerations we are primarily interested in 
the near interface atomic displacements resulting from the competing intralayer 
and interlayer atomic interactions in an epitaxial bicrystal comprizing two 
different crystals [1,2,3]. When the interlayer forces are moderately strong the 
interfacial atoms are in fair registry everywhere except near line defects 
constituting a sequence of misfit dislocations (MDs) which are said to 
accommodate the misfit. One or both crystals may also be homogeneously 
(misfit) strained to eliminate the disregistry completely and accordingly 
accommodate the misfit by socalled misfit strain (MS). Because of the 
mathematical complexities involved in a proper quantum mechanical description 
of the competing forces, either ab initio models [4,5], using appropriate 
interatomic potentials, or phenomenological descriptions [1,2,3], have been 
employed in predicting interface and near interface atomic configurations. 

Of interest here are the phenomenological models. Except for a few 
anelastic [5] refinements investigators almost invariably assumed that the 
relative intralayer atomic displacements are small enough for the harmonic 
(elastic) approximation to be valid. For simplicity mostly isotropic elasticity is 
usually adopted [1,2, 3, 7]. 

In modelling the interlayer atomic interaction provision is made for the 
discrete atomic nature of the crystals by expressing the relevant forces as periodic 
functions of the relative displacement (parallel to the interface) of atoms 
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opposing each other across the interface. There are two somewhat different 
approaches: (a) the Volterra model [2] which assumes the atomic relative 
displacement to be zero on one side of the dislocation and to be one atomic 
spacing b on the other side and (b) the Frenkel Kontorowa (FK) model [8] (and 
its generalizations and approximations) that expresses the periodic interaction 
potential V in terms of a truncated Fourier series V(x). The truncation is 
justified by the fact that the Fourier coefficients decay rapidly with harmonic 
order [9]. 

In the original one-dimensional (ID) model Frenkel and Kontorowa 
expressed the energy of their system as [8] 

E = £ ^A(x n+ i -x„ -a) 2 + £ V(x„) (1) 

n n 


with 


V(x n ) = fV 0 [l-cos(2xx n /b)], (2a) 

x„ = b(n+£„) = nb + U„. (2b) 

Equations (1) and (2) model one half crystal, simulating a monolayer (ML), in 
terms of a linear chain of particles connected by elastic springs (harmonic forces) 
of force constant A and natural length a and the other (a thick, effectively "rigid" 
substrate of atomic spacing b) to be the source of the periodic potential V (1). Its 
overall amplitude V 0 is seen as a measuere of the interlayer interaction. The 
integer n enumerates particles and potential troughs starting from a position of 
(nearest) registry so that b£„ = U„ represents the displacement of particle n from 
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trough n. 

The parabolic reproduction of the FK model replaces the sinusoidal 
relation (1) by a sequence of parabolic arcs, for example [10,11] 

V = tf2V 0 ? = x2V 0 U2/b2, b|£| = | U I < b/2, (3) 

which are obtained by truncating the Taylor series of (2) at quadratic terms. 
Hence the arcs in (2) osculate with the troughs of (1). If we assume that relation 
(2) is a true representation of the interlayer interaction then (3) is an 
approximation which is still acceptable [10,12] at U = ±b/4 and poor towards the 
boundaries U = ±b/2, as may also be inferred from Fig. 1. In semiconductors in 
which the crest of the potential V is believed to be fairly sharp as compared to 
the troughs the approximation (2) is rather inadequate and it has been suggested 
that the parabolic form (3) may be nearer to the truth [13]. 

The simple representation (2) has subsequently been extended to 
two-dimensional (2D) potentials V(x,y) appropriate to 2D interfaces 
characteristic of monolayers (MLs) [1,14], for example, and to linear interfaces 
between thick epicrystal halves [7], In the latter case it had been convenient to 
represent the interaction in terms of an interfacial shear stress p zx (x) (shear force 
per unit area) which, in the parabolic approximation [15], takes the form 

Pzx(x) = j<U(x)/d, |U| < b/2 (4) 

where d is the separation of the atomic planes on either side of the interface, x 
and z are respectively parallel and normal to the interface and /z an interfacial 
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shear modulus of magnitude characteristic of the interlayer interaction strength. 
One may use either Eq. (2) or (3) to derive the equivalent of (4) and obtain by 
comparison the relation 

/i/d = 2?r 2 V 0 /b 2 s, (5) 

where s is the interfacial surface area per atom. In any interface with quadratic 
symmetry s = b 2 . 

When the crystal halves are different in thickness and/or elastic constants 
the relative normal displacement W of the opposing atoms at the interface varies 
along the interface. This induces a normal interfacial force which has been 
modelled [13,15,16], using Hookian relations, as 

Pzz(x) = [2/i/( 1— 2i/)] W (x)/d , (6) 

where v is a Poisson's ratio for the interface. In terms of an interaction 
potential , (6) is also parabolic. Whereas in case (4) a nonlinear relation exists at 
| U | = ±b/2 when a MD is present, W in Eq. (6) will always be small enough for 
the linear Hookian relation to apply. 

Having described the background of the parabolic model it is now our 
objective to briefly review the use of the parabolic model in describing low energy 
misfit dislocation structures (LEMDSs). Of specific interest is the transition 
from a coherent (C) interface, where the misfit f is fully accommodated by MS e 
= f, to an incoherent (IC) interface with MD density f. When MDs and MS 
coexist [2,13] 


6 


f = (a-o)/b * f + | e | . (7) 

The C-IC transition is mathematically characterized by a change from f = 0 to 

f*0. 

The behavior of our system is governed by the law that all natural 
processes carry thermodynamic systems towards equilibrium. The equilibrium 
configuration is determined by minimization of the free energy which, in the case 
of MDs; may be approximated by minimization of mechanical energy E (1-3]. 
The solutions thus obtained for the governing equations within the crystals, 
which are normally in the form of partial differential equations, must satisfy 
certain boundary conditions, for example, that the "upper" surface of an epitaxial 
film on a thick substrate is a free surface. This implies that the forces that act 
on the surface, and [1,3] accordingly the surface values of the corresponding 
calculated stresses, vanish. 

Apart from the suggestion that the parabolic model could be a better 
representation of the interlayer interaction potential in semiconductors [13]. for 
example, than a low order Fourier truncation, it enters piecewise linearly in the 
governing equations making them exactly solvable. Although the need for exact 
solutions was most pressing in the case of thickening epilayers, the model has 
been useful and even indenspensible in other cases. 


2. APPLICATIONS 
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2.1. Monolayers : Ore— Dimensional Model 

Originally the ID FK model was introduced to simulate the plastic 
behavior of crystals [8]. It was subsequently applied to describe the 
accommodation of misfit in epicrystals, in which misfit exists in one interfacial 
direction only [1,7,17]. In certain interfaces the misfit may be so large in one 
direction e.g. {Ill} fcc/{l 10} bcc interfaces [18], that the relevant misfit 
accommodation approaches a misfit vernier (MY) with little effect on the atomic 
arrangements in either crystal. The small misfit direction accordingly poses a ID 
MD problem. Minimization of (1), using (2b) and (3), yields the piecewise linear 
governing equations [10,11] 

£n+i — 2£ n + £n-i = J °{£n]; | [£n] | £ 2 • (8) 

where [£„] means that £„ must be replaced by £ n • 1 if atom n enters the 
parabolic domain of neighboring troughs n ± 1. The disadvantage of the 
parabolic approximation, as may be seen from (8), is that the right hand side has 
discontinuities at £ n = which is a mathematical complication. 

Possibly the parabolic model was used for the first time [19] to support 
results obtained by Frank and van der Merwe [1] using the ID FK potential in 
Eq. (2). In that case the governing equations were a series of second order 
nonlinear difference equations which simplify in the continuum approximation to 
solvable Sine Gordon equations. Frank and van der Merwe obtained critical 
misfits [1] 
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f c = 2/ xt 0 , f s = l/'o> f9a) 

(l = Ab 2 /2V 0 . (9b) 

Below a misfit f = f c the equilibrium configuration of the chain is the one in 
which it is misfit strained to coherency (a-*b), but still needs an activation energy 
for the introduction of MDs. Above f = f 8 the energy barrier vanishes and MDs 
enter spontaneously. The question as to the extent to which the approximation 
effects the predictions was investigated using a modified parabolic potential [19]. 
The modification was an improvement on the form (3) in that the lower part of 
the potential was represented by the trough segment of (3) and towards the 
crests at x = ±b/2 by segments of inverted parabolas. This eliminated the 
discontinuities. The modification of the governing equation (8) contains linear 
pieces of positive and negative slope on the right hand side and connect smoothly. 
The investigation using the modified parabolic potential showed that the effect 
on the predicted critical values was not serious. 

Of interest is that, instead of the critical misfits f c and f s of Eqs. (9). the 
authors obtained for the parabolic model the results 

fc " gn+ia/4^) 1 ’ fc " F i 77’ 

U = f{(l + 47 ^' + 27^7 o , fs = f\ Q • 

/ 

when expressed in terms of ( 0 . The values f and f correspond to exact and 

/ » 

continuum (approximate) solutions, respectively. It is seen that f s > f s = 

/ * /i / 

whereas f c < f c - Both f c and f c are greater than f c = 2/ 7 r/ 0 ; f c only when l 0 > 2. 


(10a) 

(10b) 
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Frank and van der Merwe [1] showed tb? f / 3 « 7 in average cases in which intra- 
and interlayer forces are the same. It may thus be surmised that the regime / 0 > 

2 covers most common cases. Also if we take conveniently 1 0 = 2x it follows that 

/ * / • 

f c differs from f c by only about 3% whereas the difference between f s and f s is 

about 25%. 


2.2. Adsorbed Monolayers : 2D Models 

According to the Frank-van der Merwe (FVDM) theory an adsorbed (2D) 
ML grows coherently (incoherently) on a crystalline substrate under 
quasi-equilibrium conditions when the misfit falls below (exceeds) a critical 
value which is somewhat below the value f c in Eq. (9a) because of Poisson 
phenomenon [1,14]. The coherent to incoherent (C-IC) transition is 
accomplished by the introduction of MDs which may be of finite length and 
terminate within the ML (see Fig. 2) [20]. The MD contains one more (less) ML 
atomic row as compared to substrate potential troughs when a < b (a > b). 
When the MD involves an extra atomic row the MD grow (shrink) by the capture 
(ejection) of atoms from the surface (row). Shrinkage by the absorption of 
vacancies from the ML may also be invisaged. Phase transitions in adsorbed 
MLs had become a subject of intensive theoretical and experimental study. Not 
only had these studies generalized the theory to account for temperature effects 
but they had also introduced new nomenclature: solitons, discommensurations, 
etc. for MDs and (Taylor) dislocations [20,21] for the atomic configuration within 
the ML where the MD terminates. This configuration differs from that of a 
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conventional dislocation in that, the atomic positions within the ML not only 
depends on the atomic interaction within the ML but also on the modulating 
potential V(x,y) emanating from the substrate. Although this situation can be 
modelled in terms of Fourier truncations the resulting governing equations could 
not be solved. However when parabolic models are used instead for both the 
soliton and the dislocation they become solvable [20] for the atomic 
displacements in terms of Fourier transforms. Expectedly the analysis is 
moderately complicated in this case. Different sets of difference equations, with a 
variety of boundary conditions apply to different areas depicted in Fig. 2. Simple 
solutions had only been obtained in the continuum approximation in which 
difference equations reduce to differential equations. Of particular interest is the 
result that the interaction between the two (Taylor) dislocations terminating the 
soliton (MD) falls off exponentially with separation, whereas it had previously 
been assumed in theories of two-dimensional melting transitions in adlayers that 
dislocation dipole unbinding follows a logarithmic dependence [21]. 


2.3. Thickening Epilayers 

One of the important problems that "evaded" an exact solution is the one 
of LEMDSs at the interface between a thickening epilayer and its substrate. By 
"exact" we mean a solution that satisfies the governing equation even though the 
equation itself already involves a degree of approximation, as for example the 
continuum approximation which reduces a set of difference equations to a single 
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partial differential equation in the ID FK model fl]. Exact solution? using 
sinusoidal FK models exist for the extreme cases, namely a ML on a thick 
(essentially rigid) substrate [1,14] and an epitaxial bicrystal consisting of two 
"infinitely" thick crystals [7]. When both crystals are thicker than half the MD 
spacing the "infinite" solution is accurate to within 2% [15]. An extrapolation of 
the ML solution to multilayers, already involves a nonneglegible error at two and 
three fold layers. Attempts at interpolations between the ML case and the thick 
layer case had not been very reliable; the errors have not been calculated. 

Matthews has adapted the Volterra model to study LEMDSs in growing 
epilayers [2]. For a growing epilayer A on a thick substrate B the energy per unit 
length of a MD is written as 

Ep = Db(^n^+ 1) ; D = 

R = f j p when h^ > ^p 
h^ when h^ < ip 

R is a "cut off" radius accounting for cancelling of overlapping of strain fields, p 
and h are respectively the MD spacing and layer thickness and /i and v 
respectively the shear modulus and Poisson's ratio. The constant term Db makes 
provision for the core energy of the dislocation. By a proper choice of R 
Matthews could handle epilayers of all thicknesses. However, the Volterra model 
does not provide independently for the interfacial bond strength between A and B 
[3]. This is evidently important in epilayers of small multiplicity [13] as one may 
deduce from the fact that V 0 appears in the relation (9) for f c and that V 0 is not 
necessarily determined by and /ig. In thick epilayers the interfacial bond 
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strength vanishes asymptotically from the results [3]. 

Exact solutions of the problem of a growing epilayer on a thick substrate 
of a different crystal had previously been studied by various authors and more 
recently in greater detail by van der Merwe and Jesser [13,16], using the 
parabolic model in the form of Eq. (4). Since the crystals have different 
thicknesses, provision is also made for a normal interface stress in the form of Eq. 
(6). With the conventional assumption that the MDs are long and straight the 
problem is one of plane strain that is most efficiently analysed in terms of an 
Airy stress function satisfying a biharmonic equation that can be solved in 
terms of Fourier transforms [7,13]. The stresses are obtained from as simple 
derivatives. The Fourier transforms are selected to satisfy appropriate 
periodicity and symmetry conditions for the film, a free top surface and surface 
stresses (4) and (6) at the interface. The strains accordingly follow from Hooke's 
law and the displacements U and W by integration. The periodicity, symmetry 
and boundary conditions yield a set of linear equations from which the Fourier 
coefficients have been obtained. Thus the problem is solved in principle. By 
energy minimization the equilibrium distribution of misfit between MS e m and 
MDs f m as well as the critical misfit f c , or equivalently critical thickness h c . are 
obtained. For the critical misfit of an epilayer of thickness h having the same 
elastic properties as the substrate, we have, for example [13] 

J r® i 

q x[x-f o(h)<J)(x,h)j 4 x(I+j/)^k) > 

1 x(s 1 2 +cs-x— x 2 ) + £h( c+s) 2 

<)>(x,h) = 2 x(s 2 — x 2 ) + wi( s 2 +sc+x-x 2 ) ’ 
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( 12 ) 

Supposing that a MD enters a coherent epilayer by climb from the free 
surface, the question is: how does the energy of the epilayer system depends on 
the distance t of the MD from the free surface and on the layer thickness h? 
Whereas the energy per unit length of the MD increases as the MD recedes from 
the free surface energy of coherency (misfit) strain AE- (per unit length of MD) 
will be released. The energy of formation of a MD by climb, i.e. the change in 
energy of the system per unit length of MD AE will thus be given by (see 
Appendix) 

AE = — AE + AE n 
e D 



£ l—v „ 2( 1— i/)h 

s= (i-^)d - a= <r 


s = sinhx , c = coshx 


where 



l when l < h 
h when ( > h 


and I(£) = 1(h) with h = l. 
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The analytical evaluation of Eq. (13) is difficult because of the complexity 
of the function (j)(x), however numerically it was possible to show that the 
equation is very closely approximated, for the case of Poisson's ratio equal to 
1/3, as 


AE/(^c/4t2) = 11.4 + 4 tn(l/c) - 16ar®(//c)f. (14a) 

The maximum energy AE*, which in this analysis is a "nucleation energy" 
barrier to be overcome by the MD as it climbs to the interface, is given by the 
expression 

AE70iac/47r2) = 7.4-4 fn(4^f) (14b) 

and occurs at the thickness l* where 


(* = c/4x2f. (14c) 

This critical thickness for nucleation is related to the critical thickness for loss of 
coherency, h c by 

h c = /*[2-85 + /n(hc/c)J. (14d) 

From the relation (14d) one sees that the critical thickness h c is greater than 
three times the critical thickness for nucleation l*. When the film thickness is 
less than the critical thickness for nucleation of an MD then the energy per unit 
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dislocation length always rises as it approaches the interface and no critical 
thickness for nucleation exists. One could state this result another way. If a MD 
exists at the interface when the film thickness is less than the critical thickness (* 
then the MD is attracted to the surface and should be eliminated from the film. 

The dependences of AE, -AE e and AE D on t are illustrated in Fig. 3. 
Whereas AE D increases monotonically (essentially logarithmically), -AE e 
decreases linearly until ( = h; thereafter it remains constant. AE rises at first to 
a maximum value at thickness (*. Thereafter it decreases until t = h at which 
point the slope changes discontinuously to a positive value characteristic of AE D 
at l = h, and AE e assumes a constant value AEf(h). The value of thickness for 
which AE vanishes is l = h c (f), the equilibrium thickness at misfit f. 


2.4. Superlattices 

The problem of predicting LEMDSs in epitaxial superlattices has 
previously also been dealt with by Matthews and Blakeslee using the Volterra 
model as outlined in Eqs. (10) [22]. Naturally this involves the same 
shortcomings as mentioned in relation to the growing epilayer on a thick 
substrate. The parabolic model had also been used beneficially in this case 
[13,16]. The analysis in this case differs from the "growing layer on a thick 
substrate" in two important aspects: (i) both layers now participate in MS and 
(ii) the boundary conditions are different. As to MS we have the relations 

f « f + e(l+r' l R _1 ) ; e = |eg| , 
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' ~ h A^ ll B ’ R - 




(15) 


where the superlattice consists of alternating layers of crystals A and B of 
thicknesses 2h^ and 2hg and the other quantities in (15) have evident meanings. 
The relations in (14) assume that no bending occurs. When the crystals have 
(more or less) equal elastic constants, as is often the case, also their thicknesses 
are the same and f = f + 2e (e= | e * | = | eg | ) [13]. 

As to boundary conditions there are no free surfaces except the outer 
boundaries. The problem of LEMDSs for the interior of the superlattice has been 
analysed using the parabolic model. Apart from the relations (4) and (6) that 
hold for the interlayer interaction field periodicity with a wavelength equal to the 
MD spacing p exists. Additionaly the unit lying between the two midplanes of 
neighboring layers A and B are constrained so that the normal displacement and 
shear stresses vanish on these planes. When the elastic constants are equal the 
condition of vanishing normal displacement may be replaced more simply by zero 
normal stress on this plane. In this case of superlattices the problem of LEMDSs 
could be solved for the equilibrium MS e m and MD density f ro - For example for 
the critical thickness t} c - hg/b of the layers of type B the authors obtained 


f(j?c;r,R) 


(l-t/gMl+r^R* 1 ) 
4?r(l+t'g)/ig ” 


f dx 

J Q X[X+Q^(X)J ’ 


a = tj c , 


(16) 


where the function <j>(x) is somewhat more complicated than (j)(x) in Eq. (12). 
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Although an analy t5 ral expression for the integral could not be found it lends 
itself to fairly accurate approximations and the role of various parameters could 

be investigated. For example, certain useful scaling relations could be obtained: 

/ / 

thus for R = 1 = r it was shown that f c ( 77i,R,) = f c ( r?i,R- 2 ) where R = /i/2ji + , 

fx; 1 = H^ 1 + H g 1 and v - v g. The authors also showed that rj c ~ f* 1 ' 3 whereas tj c 
~ f* 2 for the extrapolated Frank-van der Merwe theory [ 1 ] and tj c - f' 1 
(approximately) for the Volterra approach [22]. 


2.5. Structural Ledge Interfaces 

An interesting LEMD related mechanism of misfit accommodation (MA) 
occurs in solid-solid phase transformations where the introduction of a sequence 
of atomic ledges of the same sign (down or up) effects a trade of the conventional 
sequence of efficient MDs that accommodate the disregistry parallel to a plane 
interface for: (i) interface terrace patches of zero average disregistry and (ii) 
widely spaced MDs with Burgers vector normal to the interface to accommodate 
the misfit that accumelates (stepwise at the ledges) over a number of patches 
[23,24]. Each terrace commences with a maximum disregistry on one side which 
decreases along the terrace to reach an equal but opposite value on the other side. 
In addition this disregistry is reduced by elastic relaxation due to the interlayer 
interaction. Primarily a ledge effects a relative displacement of the atomic 
paterns on either side of the interface (pattern advance), resetting the disregistry 
to the same value as at the beginning of the terrace. It has been shown using the 
parabolic model for disregistry as well as for the MDs; that this trading effects an 
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energetic gain depending on the misfit f and pattern advance S at a ledge. 
Energetically stepped interfaces become progressively more favorable than plane 
interfaces as S increases and f decreases. 


3. CONCLUDING REMARKS 

A basic aspect of the theoretical study of LEMDSs at epitaxial interfaces 
is the modelling of the periodic interfacial interaction. Frenkel and Kontorowa 
modelled this by a Fourier series (sinusoidal function), truncated at second order 
harmonics. The parabolic model is primarily a second degree Taylor series 
approximation of the Frenkel-Kontorowa model and its extensions. Although it 
is an acceptable approximation only over about one half of the period, it is 
believed to constitute a better approximation to the short ranged covalent type 
interfacial interaction in many semiconductors as compared to a low order 
Fourier truncation. Also it compares favorably to the Volterra model that does 
not provide independently for the interfacial bond strength. The main merit of 
the parabolic model is that it linearizes the equations governing the atomic 
arrangements, (resulting from the intralayer and interlayer interactions) to 
render the equations solvable. The model had been used to obtain useful exact 
solutions for the following problems: (i) in the linear chain, to assess the 
discrepancies between critical misfits predicted by (a) the continuum FK and 
parabolic models and (b) the continuum and discrete parabolic models; (ii) the 
nucleation and energetics of a MD that climbs from the free surface; (iii) the 
critical thickness (and misfit) in (a) an epilayer on a thick substrate and (b) a 
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superlattice and (iv) the energetics of structural ledges in solid-solid phase 
transformations. Clearly the parabolic model had been useful. It is foreseen that 
it will continue to be useful in analytical studies of interfaces. 
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APPENDIX: ENERGY OF FORMATION OF A MD 

Equation (29) of ref. 13, when written in terms of the MSs e x and e y for 
an epilayer on a thick substrate, becomes 
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E e = «? + ■ 

where c is the reference lattice spacing [7]. The misfit stress p xx which may be 
written as dEJd((ex), accordingly becomes 

V 

Pxx = 2ll(l+v){/(l-u) 

in the coherent e = f configuration of an interface with quadratic symmetry e x = 
e y = e. The work done per unit length of MD by p xx when the MD climbs to a 
depth l below the free surface is accordingly 


-AE g = -pxx^a = -2paJ,i(\+v) / (\-v) (Al ) 

The work done per unit MD length to form a MD at a depth t against the 
linear stress p X x emanating from the MD (p X x may be obtained from Eqs. (14) 
and (A8) of ref. 13) is given by 

1 / 

it J Pxx(0,z)adz . 

This is nothing but the energy per unit length of MD and may be obtained from 
Eqs. (30) and (A8) of ref. 13 as 





D’ 
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<d = f =1 ’ * - Td ^ 1 ~‘ /)p - !A2) 

where <J)(x) has been defined in Eqs. (13). If we write 
x = 2 -^o , dx = 2 -^dn ; dn = 1 
and let p oo we obtain the result in Eqs. (13). 
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FIGURE CAPTIONS 

Fig. 1. Comparison of the sinusoidal (Eq. (2a)) and parabolic (Eq. 3) potential 
representations; curves A and B respectively. 

Fig. 2. Monolayer (ML) configuration with a MD of finite length (between ML 
and substrate) terminating with Taylor dislocations (within the ML) at A and B. 

Fig. 3. Graph of the energy per unit length of MD (see Eqs. (13) and (14)) in 

units of /iac/4;r 2 plotted against depth in units of l/c. The curves illustrate the 

energetics of a MD that is formed at the free surface and climbs to a depth t in 

% 

an epilayer of thickness h on a thick substrate. Curves A, B, C represent 
respectively the work (AE p ) done in forming the MD in a strain free overlayer, B 
the work (AE e ) gained from the MS (Peach Koehler) stresses and C the resultant 
energy of formation AE of the MD. The nucleation parameters t* and AE* have 
evident meanings. The figure is drawn for f = 0.001, h = 260c and h c = 207c. 
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THE PHYSICAL FOUNDATIONS OF CRITICAL PARAMETER 
CALCULATIONS IN EPITAXY 


* 

Jan H van der Merwe and W A Jesser 

Physics Department, University of Pretoria, Pretoria 0002 
Republic of South Africa 


The critical misfit f c and critical thickness he at 
which an epitaxial bicrystal undergoes a coherent to 
incoherent (C- IC) transition are of great 
importance. The purpose of this paper is to 
reconcile the conditions (quasi- equilibrium/ 
non- equilibrium) under which growth occurs witn 
the physical bases of the theoretical predictions of 
the critical parameters. Because of the existence of 
energy barriers to the acquisition of the MDs, which 
are needed for the transition, it is inevitable that 
some data will reflect a time lag between growth 
and equilibration. The speeding up of the 
equilibration process by the barrier height reducing 
Peach- Koehler MS forces is highlighted. Criteria 
for predicting critical parameters are set up and the 
phenomenological (Frenkel- Kontorowa and 
Volterra) ana ab initio models used in calculating 
these parameters are explained and appraised. 
Examples, demonstrating calculations and their 
correlation with empirical data in each category, 
are briefly described. 


INTRODUCTION 

The transition of an epitaxial bicrystal, A on B, from registry (coherency) at 
the interface to disregistry (incoherency) is of great fundamental and 
technological importance. This transition, wnich will be referred to as the C- IC 
transition, has recently become topical again (1- 4). It is therefore vital that the 
principles and models used to predict the critical thickness he (or misfit / c ) at 
which the transition occurs, be clearly understood and their reliability be 
assessed, theoretically and experimentally (5,6). 

The misfit between A and B is most simply defined for interfaces with 
rectangular lattice symmetry as (4) 

A = (oi-bij/du i = x,y , 


( 1 ) 


where the value of d\ is between aj and b\ and depends primarily on the relative 
thicknesses h and h D , When the interfacial symmetries are different, as for 

{111} fee/ {110} bee interfaces (8) specification of / needs more care. Recently 
Braun (9) has introduced a powerful description of the interface geometry 
(including misfit) in terms of reciprocal lattice vectors. 

Of the various modes of misfit accommodation (MA) illustrated in Fig. 1 (5,6) 

we shall limit ourselves mainly to (i) misfit (homogeneous) strain (MS = e), (ii) 
misfit dislocations (MDs of density f) which may be efficient (EMDs) or 
inefficient (IMDs) when their Burgers vector is inclined to the interface or even 
split into partials and (iii) a misfit vernier (MV) when the misfit is large and/or 
the interfacial bonding small so that the periodic strains associated with MDs 
practically vanish. An interesting atomic configuration of MA, referred to as a 
misfit- Taylor dislocation (MTD), (10) occurs in ultrathin overlayers of one ML 
or less. The C- IC transition may namely be realized by the nucleation of a finite 
segment of MD at the coherent interface and its subsequent growth by a zip- like 
mechanism effected by the consecutive inclusion (or ejection) of adatoms until 
the MD reaches a boundary edge of the overlayer. In the finite segment stage 
the configuration is equivalent to an extra (or missing) row of overlayer atoms 
terminating within the overlayer sheet. This constitutes a "Taylor" dislocation 
within the overlayer sheet and hence the name MTD for this combination of 
dislocations. MTDs may be operative at the C- IC transition in MLs. Two more 
forms of MA, namely, " ruptures " and distortions " will conveniently be 
introduced at a later stage. Misfit curvature (MC) and misfit strain gradient 
(MSG) modes of MA (5,6) also exist but will be ignored below. 

When the misfit in a given direction is jointly accommodated by MDs and MS 
we have approximately 


/ = /+e, c = | c A | , A thin (11) 


f =J+ Pe, e = | e B | , P = 1 + 


W'-V 

w'-v 


( 2 ») 

(2b) 


where Eq. (2b) applies to layers with arbitrary thicknesses 2 h k and 2 h B , for a 

system that is constrained not to bend (12). If the temperature at which 
measurements are made differs from the growth temperature the thermal strains 
may be different in A and B and must be taken into account. In technology 
thermal strains are often matched (approximately) in accordance with Vegards 
Law by selecting suitable alloy compositions (13). 


BASIC CONSIDERATIONS 

The growth of epitaxial bicrystals involves both the dynamics of adatom 
surface migration and of the acquisition of MDs. Both processes require the 
overcoming of characteristic energy barriers of which the migration activation 
barriers Q are typically between 0.5 and 1.0 eV in metal surfaces with closest 


packing (14). The energy barriers U ac opposing the acquisition of MDs, form a 
hierarchy with a maximum near 200 eV for the nucleation of a MD loop in a 
perfect 3D crystal, down to a fraction of an electron volt per atom length of MD 
lor overcoming the Peierls barrier to glide (11). Whereas the overcoming of the 
former by thermal energy is completely prohibitive, the latter may be reduced by 
Peach- Koehler (MS) forces (5,6) to within the range (< 30 kT) of thermal 
activation for a finite length of MD, and may even be reduced to zero if the MS is 
large enough. The driving forces (5,6,11) for the abovementioned processes - 
adatom migration and MD acquisition - are the free energy gradients AA. These 
forces vanish when the equilibrium configuration is realized at minimum (free) 
energy A : 

A = min. a E = mechanical energy. (3) 

The approximation A a E, applies to MDs which form regular arrays in an 
otherwise perfect crystal. It is exactly valid only at absolute zero. The majority 
of past calculations have been based on (11) the equilibrium principle in Eq. (3) . 
Non- equilibrium calculations are of mostly of recent origin and represent 
attempts at (i) calculating activation energies (15), (ii) at establishing the 
reduction of such barriers by Peach- Koehler forces (16) and (iii) at describing 
the dynamics of MS relief by the introduction of MDs (17). 

An aspect which is of great significance in the considerations below is the 
growth mode (18). For the clear exposition of the basics of our considerations we 
may retrict ourselves to the growth of overlayers of uniform thickness as in 
ML- by- ML growth. In quasi- equilibrium growth this requires the 
adatom- substrate (A- S) bonding to be strong in comparison with 
adatom- adatom (A- A) bonding. Relatively strong A- A bonding induces island 
growth at quasi- equilibrium, but may be coerced (19,20) into layerlike growth by 
an appropriately high supersaturation (non- equilibrium) so that single atoms are 
super critical nuclei and have sufficient time and mobility to move into a local 
(crystalline) stable position. This is a technologically important case. 


MODELS 

The minimization procedures, involved in the application of the governing 
condition (3), can be carried out either through ab initio or phenomenological 
model calculations (5,6). In the former the minimization proceeds directly by 
Monte Carlo or other methods using appropriate interatomic potentials (15). In 
the phenomenological models (11,21,22) the most relevant features of the 
epitaxial system are at first modelled phenomenologically for use in deriving 
energy expressions that can be minimized with respect to the relevant variants. 
Whereas the ab initio models are quantatively more accurate, depending on the 
reliability of the potentials employed, the phenomenological models have greater 
generality and wider predictive powers. 

The phenomenological models can be broadly classified according to the 
representation of the interfacial (A- B) interaction in: (a) the Volterra model (11) 
with an interfacial registry constraint (except at the dislocation line singularity), 
and (b) the Frenkel- Kontorowa (FK) model (20- 23) in which the A- B 
interaction is represented by a periodic potential V of wave length d lying 



between a and ft as in Eq. (1). In the original ID FK model (21,23) consisting of 
single (ML) chain of particles on a rigid (thick) substrate (see Fig. 2) 

V(x) = 4 Vo[l-cus(2itx/b) ] , d = ft . (4) 

Vo may be linked (8) to the adatom migration barrier Q and the desorption 
energy £des (A- B bond strength) by 

Q = gV 0 = nEfa . (5) 

The factors g and n depend on the dimensionality, geometry, and bond type and 
strength. In Eq. (4) g= 1. 

The form (4) of V constitutes a ID truncated Fourier series which may be 
extended to 2D (8,9,20) in the interface between a ML and a thick substrate or to 
a "linear" interface with ID misfit between two thick crystals, as in the 
generalized Peierls- Nabarro model (22). The truncation is justified by the fact 
that the Fourier coefficients C\ decay rapidly with harmonic order (24). A 
significant property of the Fourier description is that each Fourier term defines 
an epitaxial orientation and geometry in which the magnitude of the 
corresponding coefficient is a measure of the tendency towards realization of that 
orientation (8,9). 

The potential in Eq. (4) degenerates into the parabolic potential (25,26) 

- - a sequence of parabolic arcs - - when Taylor- developed and truncated at 
lowest (2 degree ) powers: 

V=\b-WVrf, |i| <ift. (6) 

The A- A and B- B interactions in both the Volterra and FK (11,21,23) 
models have been modelled in terms of the nearest- neighbor isotropic harmonic 
(linear elasticity) approximation. More recently this has been extended to 
anisotropic elasticity (9), nearest- neighbor anharmonicy (Fig. 3), (27) and to the 
case where the bulk structure of the overlayer has a "nearby" metastable phase 
that can be reached by a homogeneous strain (MS) (28,29). In the latter two 
cases the point(s) of inflection where the pair potential and free energy of the 
initial structure becomes unstable, has dramatic consequences. In the former 
case this leads to additional modes of MA "ruptures" in case of strong A- B 
bonding in combination with positive misfit or to " distortions " at negative 
misfit. In both cases this occurs because the (nearest- neighbor) bonds become 
streched beyond the point of inflection. While the "rupture" may be seen as a 
degenerated MD, the "distortion" represents a stretched bond that separates 
over layer segments of close registry with the substrate. The overlayer segments 
represent dimer, trimer.... groupings, depending on the magnitude of the misfit. 

It is proper at this point, to record some expressions for the energies as 
calculated for the simplest case of isotropic modelling. Since the MS energy j EL 

and the energy E' per unit length of MD in a sequence of regularly spaced (p) 

MDs are additive, we may specify them separately: 



(7a) 

(7b) 


£( a ) = B h e\ per unit area for .4, 
as introduced by Matthews (11) and by Jesser and Kuhlmann- Wilsdorf (30), 


E i> = ()‘ KB cp/irV[l+ff-(l+/P) i -Ptn{20(l+lP^ -2IP}] , 

g 2r\nd 1 1- \ 1-1, » 

w r„ « A n B 

in the FK model (22,25) (for > ip) and 


E i 


R = 


Db{in(R/b) + 1}, D = X 0 b/2x, 
i p when h k > ip 
h k when h k < ip 


d = b 


(8a) 

(8b) 


(9a) 

(9b) 


in the Volterra model (11). In the foregoing p and v are respectively the shear 
modulus and Poisson's ratio, the subscripts A and B have evident meanings, and 
R is referred to as the "cut off " radius. Note that the interfacial shear modulus 
/* AB , measuring the strength of interfacial interaction, is absent in the Volterra 

model. It is of interest that the expression (8a) reduces to the form (da) in the 
limit of large p, with R 2 p/10 rather than ip. Furthermore, the relations (8) 

and (9) (relating to zero MS e) may be adapted for nonzero MS simply by 

replacing d and p by d and p with / = d/p, where the bar designates quantities in 
a configuration with MS. For purposes of energy minimization we need 
expressions for the total energy per unit area of interface: 


E ^ + E- e = E(h k ,h Q ,fJ ore ) 

(10) 

' D = fPy^ <x) + hE^ y) )/hPy per unit area 

(Ha) 

= 2 E' Jp for quadratic symmetry (px=Py=P)- 

(Hb) 


Equations (11) relate to cross grids of MDs in which the crossing energy of MDs 
may be neglected (22). E ^ x) is the energy per unit length of a MD in a sequence 

of MDs spaced at distances p x along the x- axis. 



CRITERIA 


Because of its greater analytical simplicity we adopt a criterion for locating 
ciii icality (5,6,11), the limiting operation 

/ -* 0 (equivalently e -» /) (1-2) 

defining the inverse (IC- C) of the C- IC transition. Using relations (3) and (10) 
we accordingly obtain the minimization equations 

0 = |f = ^for ( ej ) = (e a ,J a ) . (13a) 

The relation dE/d] = 0 in (13a) is equivalent to requiring that the work W 
needed to introduce an additional MD vanishes: 


W = f Fds = 0 at ( ej) =(e m ,J m ). (13b) 

In the relation (13b) F is either an external force on the "free" surface that is 
used to introduce the dislocation and ds an element of displacement of the surface 
(21) or F is the resulting integrated force on an existing TD (threading 
dislocation) (11) that glides to lay the MD in the interface and ds in the element 
of displacement of the TD. When Fis constant Eq. (13b) is satisfied by 

0 =F = F--F J> , (13c) 

where F- is the MS (Peach- Koehler) force on the TD and F the line tension of 

C l) 

the MD being laid in the interface. 

By combining the criticality criterion (12) with one of the equilibrium 
conditions (13) we obtain the equations (5,6) 

* (h,f c ;7 = 0) = 0 and $ (hcjj =0)=0 (14) 

defining respectively the critical misfit / c and the critical thickness he. $ is the 
corresponding function in Eqs. (13). 

The. fact that in the ID FK model the density ] of MD rises abruptly and 
closes rapidly in on / at the C- IC transition was taken as ground to introduce 
the approximation (25) 

E- e (h,f c ;e = f) « EJh,f c ;] = f) (15) 

However, this equation overestimates (12) f c by up to 30%. 

The adoption of equilibrium criteria implies that MD are freely available, 



which they are not because of the existence of energy barriers (5,6,11). In 
analogy to (13b) the energy barriers to nucleation and motion may be obtained 
by integration as 

U &c = (Fds. (16) 

•'bar __ 

When reduced by Peach- Koehler forces to about 30 kT and less, the barriers can 
be overcome by thermal fluctuations, given enough time. When reduced to zero 

U= 0 (17) 

the nucleation or motion is spontaneous. In the sections below we briefly 
describe calculations of / c and he with the view of illustrating the foregoing 
considerations. The illustrations are not exhaustive. Equilibrium predictions 
using Eq. (13c) will be referred to as force criteria, and energy criteria otherwise 
(5,6). When applicable force criteria are superior in simplicity. 


EQUILIBRIUM CALCULATIONS 

Although the Frank and van der Merwe theory (21) based on the ID FK 
model, lacks accuracy when applied to real systems, it has the merits of 
simplicity and capability of making predictions about most of the features of 
present interest: a critical misfit 

A" =£, = »-!*»» ML, (18) 

obtained using either (13a) or (13b) and a critical thickness (see Eq. (14)) 

h^ 1) = rice = 8 VbaAV 1 *^/ 2 (19) 

In these equations the superscript (1) designates ID FK model. p (1) is the force 
constant of the springs modelling the harmonic forces (Fig. 2). Accordingly the 
ratio lo is a measure of the relative strength of the competing interactions; as 

expressed by p (1) and Vo. The authors estimated that /c 1> «9% and that f l c l) 

decreases in going to a 2D quadratic model to fi 2) -7%, due to the Poisson effect 
(31) in a model with a 2D quadratic interface. 

In 2D interfaces of the kind {110} fcc/{l 10} bcc (8,32) the misfits f x and / y 
differ greatly and are most suitably expressed in terms of the ratio 

r = Onn/^nn (20) 

of nearest- neighbor distances a nn and b aa . Aspects of great significance in the 
Fourier description for these cases are (8,9): (i) that minima of interfacial energy 
exist for atomic (ideal- epitaxial) configurations in which sequences of atomic 


rows on either side of the interface come into ID registry (coherency, i.e. wave 
vectors on either side of the interface match), (ii) that the matching and depth 
are determined by the wavelength of the Fourier term and the magnitude of its 
coefficient respectively, (iii) that the depth of the minimum is a measure of the 
epitaxial tendency of the related epitaxial configuration (dimensionally and 
orientationally) and accordingly provides the driving force for MS towards 
coherency, and (iv) the depths are additive when 2D registry is attained. "Phase 
diagrams" defining the critical values r xc , r yc and r c have been constructed (8,9). 
In these, in which r xc and r« c mark (19,20,21) ID coherency breakdown the 
critical thickness is also defined approximately as in Eq. (19). 

The main consequence of including anharmonicity (27) is that the critical 
parameter values branch into two: the lower branch for positive natural misfit 
(coherent overlayer compressed) and upper one for negative natural misfit. The 
magnitudes of the gaps depend on the details of anharmonicity and the misfit in 
relation to the point of inflection of the nearest- neighbor (nn) pair interaction. 
The "rupture" and "distortion modes of MA obtain when the nn bond is 
stretched beyond its theoretical strength (see Eqs. 1(f) and 3(a)). 

An important generalization of the foregoing is the one in which there exists a 
structural transition of the crystal to a nearby metastable phase (28,29) that can 

be reached by homogeneous (misfit) strain e, for example, the tetragonal 
distortion of a bcc phase into a metastable fee phase. There exist critical 
parameters for both the stable and metastable configurations as well as two 
points of inflection of the free energy vs strain curve. If the critical misfit in the 
stable ML configuration exceeds the first point of inflection and the natural 
misfit closely matches the tetragonal strain corresponding to the metastable 
configuration, the "critical" thickness (thickness to which the metastable 
configuration persists) will be very much higher than would be expected on basis 
of the harmonic (approximation) calculation, assuming that the existing misfit be 
accommodated by MS (referred to the stable phase) alone. 

The model on which Eq. (19) (and its 2D extension) for the critical thickness 
is based, suffers from the deficiencies as listed above. Equally important 
deficiencies are that the strain gradients in the overlayer normal to the interface, 
have been neglected (8,9,19,20). Previous attempts at including these have all 
been approximate. Exact solutions (12,26) had only been obtained by using a 
parabolic (approximate) potential of the form in Eq. (6). The resulting 
expressions are complicated. For the case of a single multilayer on a thick 
substrate (both isotropic and with equal elastic moduli p and v, including the 
interface) the critical misfit is given by (26) 


f c (h) ~ ITjfcj/o xlx+U'MJ’ a(h ) = 2( \ V)h ’ ( 21 ) 

where h = and $ is a complicated function of x and o, that increases in 

complication when the moduli are different. The Volterra approach instead 
yields (11) the results (see Eqs. (7), (9), (13)) 



( 22 ) 


fc(h) - fjrgfi{tn(hA)+l]> he - pgj{t n (h cA) + * 7 - 

These are evidently simpler than Eq. (21) to analyse, but rather inaccurate at 
small thicknesses because of the approximations involved in fcqs. (9). Note that 
in Eq. (22) (and also in (24) below) h = h A falls in the class h A < ip of Eq. (9b). 

The parabolic model has also been used to obtain (exact solutions) critical 
parameters (12) e.g. for a superlattice obtained by stacking A and B layers in 
alternating succession, giving 

(l-v n )Pp> n -oo , 


where P is defined in Eq. (2b), o(/i B ) = 2(1- M and $ is a complicated 

function of elastic constants and thicknesses, which simplifies greatly when these 
quantities are respectively equal. Of interest is that hc/d varies approximately as 
/•* with k » 1.22 as compared to k = 2 in Eq. (19) and k « 1 in Ea. (22). Energy 
criteria were also used in conjunction with the relations (7) and (8) to calculate 
approximate values for (i) the critical thickness in an epitaxial bicrystal of two 
equally thick halfcrystals (33) and (ii) the critical size Ac of a hemispherical 
island on an extended 3D substrate crystal with similar elastic constants (34). 
The results in (i) correlated excellently with the observed critical thickness in 
PbS/PbSe epitaxial bilayers (35). The results in (ii) were qualitatively correct 
(34). 


The' power of the force criterion in Eq. (13) (combined with the Volterra 
model) to handle rather complicated cases is demonstrated by the derivation of 
the critical layer thickness (11) 


l Dbf 1-ucos 2 a)fln (h c /b)+ 1/ + oob sin a 

nc= m /cos A - jo /cos $ 


(24) 


in which the C- IC transition was accomplished by imperfect partial dislocations. 
In relation (24) A is the anele between the slip direction and the interfacial 
direction which is normal to the line of intersection of the slip plane (S) and the 
interface, a is the angle between the line of the mixed MD and its Burgers vector, 

and $ is the angle between the unit vector n, normal to S and the line of 
intersection of the free surface and a plane which is normal to the free surface and 

contains n. Also, % and <to are respectively the stacking fault energy and the 
surface free energy of the overlayer surface where the gliding TD traces a step of 
height ft* sin o. A relation analogous to (24) can be written down for a MD that 
is generated by climb (11). The force criterion was also applied to superlattices 
by Hirth and Evans (36) and to multilayers, (allowing for bending) by Bokii and 
Kuznetsov (37) with satisfactory results. 



NON- EQUILIBRIUM CALCULATIONS 


We have classified as non- equilibrium (5,6) all calculations (i) of energy 
bd,i riers to the acquisition (nucleation well as motion) of MDa, (ii) of barrier" 
reduction and elimination by Peach- Koehler forces and (iii) of the dynamics *»f 
the acquisition processes, e.g. dynamics of MS relief. 

The Frank- van der Merwe (21) analysis of the ID FK model is also verv 
powerful in illustrating non- equilibrium considerations, particularly features (i) 
and (ii) listed above. The authors considered the introduction (nucleation) of a 
MD from the free end of of the chain by displacing the endpoint by an amount 


a(, obtaining for a coherent layer U((,f) and U^c(f) 

U(i (,f) = (2 Volo/*)[ ( cos*(o-hcosx()+fto( -*0+2k], (25a) 

V K (f) = (4 - ft„ arccostflo)], (25b) 

where ^,2 = 1,0 and Si, 2 = - 1,1 when £ < 1 and f > 1 respectively. The 
displacement of 0 of the free end is defined by 

sin jrfo = flo- (26) 


A MD is created when the endpoint is displaced from the stable equilibrium 
position f = fo to the nearest stable equilibrium position £ = l+£o- When / = 0 
it follows from Eq. (26) that £0 = 0, defining (see Eq. (25a)) the nucleation 
barrier at zero misfit as 

Ua,0) = (2Volo/*)(l- cosjV . (27) 

The residual part of U((,f) constitutes the reduction of the barrier by the 
Peach- Koehler MS forces. These considerations are illustrated in Fig. 4. 
Though Fig. 4 corresponds to a misfit / = f c (= equilibrium critical misfit 2/ Wo) 
the acquisition of MDs is still confronted with a nucleation energy barrier. 
Curves A, B and C display respectively the natural barrier U((,0), the details of 
the reduction of the barrier by the Peach- Koehler forces and the net barrier 
U((,f c )- The barrier height attains the values 4 WIq/t, 1.6 IVo/* (approx.) and 
0 when / = 0, / c and f s respectively. At 

or (28) 

the barrier vanishes and MDs enter spontaneously (without thermal aid). For 
metals is of the order of 1.0 eV so that even at / = f c the activation 

energy barrier is still about 1.0 eV per atom length of MD. The nucleation 
barrier for a finite (critical) length of MD may thus still be above the common 
thermal threshold of about 30kT. It thus seems likely that "easy" sources of 
MDs are normally needed for the C- IC transition, for example threading 
dislocations. The energy barriers to the nucleation of MDs at the crystal surface 
using continuum theory and in the interior of a crystal with MS, using ab initio 
(16) calculations had also been carried out with equivalent conclusions. The 
foregoing considerations are helpful in interpreting recent results obtained by 



Tsao et al (16) regarding MS relief in Si x Gei- x strained layers grown on (001) Ge 
substrates. The authors have compared the components F- pQJ . of the 

Peach- Koehler MS and the dislocation (Dresumably TDs) line tension Fq 

forces with the observed MS relief and concluded that "mstastable strained layer 

breakdown inmost directly correlated with (i) an excess stress 

Vc = 2 < F i, par" F DpaM b ^ (“) absolute temperature T\ When written 

as an excess force 

F exc ~ F e,par * F D,par 

the result represents the non- equilibrium (Ft 0) equivalent of relation (13c) and 
may be correlated with the reduction of the activation barrier as described by 
Eqs. (25a) and (25b), and displayed in Fig. 4. Although Eqs. (29) and (25), refer 
to different cases, the former to the motion of a TD and the latter to the 
nucleation of a MD, the principles involved are the same. The relation Fexc = 0 
still describes equilibrium as in Eq. (13c). When Fexc f 0? the related barriers 
delay MS relief which only begins to occur with an observable rate at an excess 
stress Tex c/p » 0.024 that has reduced the Peierls barrier of a critical length of 

TD to within the thermal threshold at 494°C. 

A very significant mechanism of reducing the energy barrier to the acquisition 
of MDs, and to tailoring them, has been discovered by Fischer et al (39) for the 
growth of GaAs on (100) Si substrates. The authors had cut the (100) Si surface 
off axis (tilted optimally by about 4° towards [001]) so that the exposed Si 

surface forms a 2D staircase with ledges along [Oil] and [Oil]. The ledges 
apparently facilitate the recruitment of substrate dislocations that terminate at 
the Si surface, drawing them into the ledges where they become fairly tightly 
bound EMDs that are prevented from threading the GaAs overlayer. 

Dodson and Tsao (17) have also analysed the dynamics of MD acquisition 
using a phenomenological model and making various assumptions of which the 
most prominent one is that the dislocations are driven by a local stress rioc that 

can be expressed in terms of a residual mismatch / - J(t) - e(h), in which e{h) is 

the equilibrium MS at thickness h and the MD density ]{t) at time t falls short 
in accommodating the residual mismatch. The authors derived the equation 

ij/dt = mi - ho - i(k)M - j„) po) 

for the dynamics of MS relief; C being a constant and Jo a background dislocation 
density. Equation (30) gave a satisfactory description for the observed MS relief 

of SiGe alloys on (100) Si substrates with values of C and Jo given by C = 30.1 

and Jo = 10* 4 . 



SUMMARY AND CONCLUSIONS 


The main objectives of this paper have been (i) to correlate the principles, 
specifically equilibi ium/non- equilibrium principles on which critical 
misfit/critical thickness calculations are based, with the experimental conditions, 
under which they are measured, (ii) to describe the models used in the 
calculations, (iii) to highlight the deficiencies, advantages and disadvantages of 
the various approaches, and (iv) to briefly describe sample calculations, making 
no attempt to be exhaustive. We have given prominence to the prediction of 
critical parameters in layer growth, an approach which we justified on the 
grounds that either an overlayer has a natural tendency to grow in the 
ML- by- ML mode because of strong adatom- substrate bonding and subsequent 
homoepitaxial growth, or the overlayer can be coerced into growing layerlike by a 
suitable supersaturation. In a C- IC transition the coherency breakdown is 
accomplished by the acquisition of MDs, a process which is opposed and retarded 
by a hierarchy of energy barriers, some of which naturally fall within the thermal 
threshold and others which need be reduced, mainly by Peach- Koehler forces to 
come within this threshold. It is therefore not suprising that measured critical 
values are scattered over regimes covering the entire range from non- equilibrium 
to equilibrium values. In metals where the Peierls barriers to dislocation glide 
are believed to be relatively small many observations correlate with equilibrium 
predictions whereas in semiconductors with strong localized covalent bonding the 
barriers are high and many observations cannot be correlated with equilibrium 
calculations (5,6,11). Because of their relative simplicity most calculations of the 
past have been equilibriumwise. However, recently there had been a growing 
interest and activity in the non- equilibrium regime. This is very timely 
particularly because of the technological importance of semiconducting materials. 

As to models, the phenomenological models of Volterra and of Frenkel and 
Kontorowa have been the main goalls of the past and will still play a significant 
role in the future because of their simplicity and greater analytical predictive 
power. Ab initio calculations, rightly aimed at greater accuracy, will be carried 
out with increased frequency. 

The Volterra model has the advantage of greater simplicity, particularly in 
allowing for varying layer thickness, whereas the FK- model is more accurate at 
small (l- 2 MLs) thicknesses and in addition is very powerful in selecting 
epitaxial orientations for which the coherency lock- in power is maximal. 
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Fig. 1. Misfit accommodation modes: (a) misfit vernier, (b) misfit strain, (0 
efficient MD, (d) inefficient MD, (e) misfit- Taylor dislocation i.e. MD PQ 
(between a monolaver ML and a substrate) ending in a Taylor dislocation at Q 
within the ML, (f): (i) a coherent chain of particles, (ii) a chain with a>b 
containing "ruptures" (broken segments), (iii) a chain with a<b containing 
"distortions", (g) misfit curvature and (h) a misfit strain gradient. 
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Abbreviations 

AAI adatom-adatom interaction 
ASI adatom-substrate interaction 

C-1C commensurate-incommensurate, coherent-incoherent 

FK Frenkel-Kontorova 

IMD inefficient (imperfect) misfit dislocation 

KS Kurdjumov-Sachs 

NW Nishiyama-Wassermann 

MA, MC, MD, MF, MS, MSG and MV Misfit accommodation, curvature, 
dislocation, fracture, strain, strain gradient and vernier 
ML monolayer 
TD threading dislocation 
VM Voherra model 


1. Introduction 
1.1. Misfit 

The epitaxial growth of one crystal on another has been studied as a fundamental 
problem for over six decades. Only in the past two decades, however, has there 
existed a practical interest in understanding the phenomena more completely. 
This recent practical interest has arisen out of the great need of the semiconductor 
industry for crystals free from dislocations and other defects. As the use of 
electronic devices became more demanding and sophisticated, the need for 
perfection of the crystals comprising the devices has increased. This is particularly 
true for superlattice structures. 

Central to the growth of nearly perfect epitaxial overlayers is the concept of a 
coherent or dislocation free interface. The thickness above which dislocations 
appear in the interface during the growth of an epitaxial layer is known as the 
critical thickness. Because of the importance of this parameter to the fabrication 
of acceptable semiconductor devices, classification of the various approaches to its 
calculation is desirable. Before discussing the calculations of critical thickness, it is 
necessary to define the nature of the misfit associated with the interface between 
two different crystals. 

The general case of fitting together two crystals of different lattices in an 
oriented way across an arbitrary but fiat interface, was dealt with theoretically by 
Pond [1] and by Bilby et al. [2]. We follow the latter authors who introduced a 
geometrical closure failure B to a Burgers circuit enclosing a section of the 
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interface. The vector B varies with direction of the interface and may be written 
in terms of a reference vector V lying in the interface as 

B = [Sr' ~S;'\V, (1) 


where 5, and S 2 are the “deformation” matrices that generate respectively the 
lattices of crystal 1 and crystal 2 from a reference lattice in which B and V are 
defined. The general case that includes crystal symmetry and coincidence lattices 
can be incorporated in the use of eq. (1) to express the deviations from 
coincidence and symmetry positions. In this chapter we are concerned with cases 
in which the crystals are simply related, to one another through the matrix S of 
small deformations. In this case the closure failure is represented by a set of 
dislocations of sufficient separation to be considered individually. When V cuts m i 
dislocation axes of Burgers vectors b t then 


B = X m i b i , 


( 2 ) 


and the average spacing between like dislocations of given type i measured along 
V is Vm~\ It is convenient to express the closure failure in terms of three 
physically distinguishable components [3]: B t is the component normal to the 
interface and represents the component of B produced by tilt around V y where V 
is the unit vector parallel to V; is in the plane of the interface and normal to V 
and represents the twist or shear component; and lastly B { is parallel to V. The 
component B { is the misfit component which is of interest to the present chapter, 
and it is this component that provides a definition of misfit / as measured along 
the direction V of the interface as 

/= y = ■ (3) 


When the deformations 5 are small and contain no rotation terms a pure misfit 
boundary between two parallel orthorhombic crystals results and typically sug- 
gests one of two choices for the reference lattice. When the two crystals across the 
interface are of comparable thickness, or are both “thick”, then the choice of a 
reference lattice that has lattice constants between those of either crystal is 
appropriate. According to a simplified treatment of the closure failure [3] one of 
the reference lattice constants, say A is chosen to be 

c = a x a 2 la 2 , (4a) 

where a, and a 2 are the lattice constants in a chosen principal direction a of the 
respective crystals 1 and 2 and 

a = \(a x + a 2 ) (4b) 

is the average lattice constant. This choice of reference lattice constant is also the 
one adopted by van der Merwe in his model for thick crystals [4], Equation (3) in 
conjunction with eqs. (1) and (4a) and (4b) yields for this case 

= 2 ~ V c: similar thickness , (4c) 

as the misfit in the a-direction of the lattices. 
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The second choice of reference lattice is appropriate to a thin crystal on a thick 
substrate. In this case it is natural to take the substrate as the reference crystal. If 
we call the reference crystal 1 then S~ ] = /, the identity matrix, and hence eq. (1) 
becomes 

f» = [/-5J , ]V. (5a) 

which is equivalent to the approach of Bollmann [5]. The misfit now, is by eqs. 
(3) and (5a), 

A = 0*2 “ & i ) !a z • thin overlayer . (5b) 

The advantages of definitions (4c) and (5b) have been discussed by Matthews [6]. 

Epitaxy of crystals with different symmetries is also of interest. A case that had 
been considered extensively is the epitaxy of metals at (111) fcc/(110) bcc 
interfaces [7, 8] in which two dominant epitaxial orientations (the Kurdiumov- 
Sachs and the Nishiyama-Wassermann orientations) differing by about j^occur 
depending on the ratio of nearest-neighbor distances. The two misfits and / 2 , of 
the form (4c) or (5b), are usually defined for two suitable orthogonal interfacial 
directions, /, and /, being significantly different. 


1.2. Modes of misfit accommodation 

The above considerations are purely geometrical. The special response of the near 
interface atoms to the fields AA and BB within each of two misfitting epitaxial 
crystals A and B and the coupling field AB between them is normally referred to 
as “misfit accommodation'' (MA). In the foregoing we have tacitly assumed that 
the misfit / is accommodated by dislocations, so-called misfit dislocations (MD’s). 
However, this is not the only mode of MA. The resulting mode in any given case 
depends largely on the magnitude and nature of / as well as those of the fields. 
The possibilities considered are illustrated in fig. 1. 

Consider a pure misfit boundary, i.e. S 1 and S 2 are diagonal matrices that 
represent small dimensional changes, not rotations. There are numerous modes 
by which the misfit of this interface can be accommodated. These mechanisms can 
operate individually as a single mode or in concert with several modes participat- 
ing. Homogeneous elastic strains that reduce the effective misfit from / to zero 
(fig. lb), i.e. B— >0, have first been observed in metals [9-13) as reviewed by 
Matthews [14] and are now commonly observed in strained layer superlattices of 
semiconductor crystals [15]. This strain is referred to as the misfit strain (MS) 
[16-17], and interfaces for which the misfit is accommodated entirely by MS are 
said to be pseudomorphic or coherent [18]. Similarly, free-standing crystals may 
bend with neutral planes that are positioned so that the misfit at the interface is 
accommodated by the lattice parameter gradient introduced through lattice 
curvature [19-22]. We can refer to this mechanism (fig. lc) as misfit curvature 
(MC). A third strain mechanism for accommodating the misfit also exists. It is a 
simple gradient of strain (misfit strain gradient: MSG) that takes a coherent 
interface from its constrained lattice parameter at the interface to its bulk lattice 



4 Ifb 


W. A. Jesser and J. H van der Me me 


Ch. 41 






Fig. 1. Diagrammatic illustration of modes of Misfit Accommodation (MA) in a bicrystal of crystals A 
and 8 with natural lattice parameters a and b. The full lines designate atomic planes and the dashed 
lines interfaces, ail viewed edge on: (a) Crystals as though rigid, or with vanishing interfacial 
interaction, or both; Misfit Vernier mode of MA. (b) Crystal A homogeneously strained (a— *■ a) 

into registry (d = b) with B\ Misfit Strain (MS) mode of MA. (c) Crystals bent into registry at 
interface; Misfit Curvature (MC) mode of MA. (d) Misfit accommodated by a Misfit Strain Gradient 
(MSG) within a finite thickness, (e) Crystals elastically relaxed so that both the residual interfaciai 
misfit and the strain are very much localized around regularly spaced (dislocation) lines; Misfit 
Dislocation (MD) model of MA. (f) A MD formed from a glide dislocation on a slip plane S has its 
Burgers vector inclined to the interface and constitutaran Inefficient Misfit Dislocation (IMD). 

parameter at the free surface [23] as in fig. Id. This can occur when the height to 
width ratio for a deposit is comparable to the mutual misfit. More precisely, since 
the MSG is not expected to be linear but to decay rather faster than linearly with 
distance normal to the interface, there is an effective thickness above which the 
lattice parameter is essentially that of the bulk. It is this feature which means in 
practice that there is no critical thickness (or the critical thickness is infinite), but 
there is a critical misfit which decreases with increasing width. Heterostructures 
involved in ultralarge scale integration can take advantage of this misfit accommo- 
dation mode. Recent evidence of misfit dislocations between the reconstructed 
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topmost surface of a small gold platelet and its bulk underlying layers [24,25] 
suggests that the MSG mode of accommodating misfit may not always be 
energetically favored. 

When the interfacial bond is weak and the misfit large [8], the structure of the 
interface (fig. la) resembles the rigid misfit vernier (MV). This mode of misfit 
accommodation or one approximating it has been observed in some metal-non 
metal epitaxial systems [26]. 

An uncommon mode of misfit accommodation is fracture (misfit fracture: MF). 
If the interfacial bond is strong and the ep itaxial layer brittle, then a ten sile misfit 
strain can be relieved by a crack on the fTearest/ cleavage plane that ^normal to 
the interface. Epitaxial garnet films show this mode of misfit accommodation [27]. 

The most common mode of accommodating misfit is by misfit dislocations 
(MD’s; fig. le). A MD is defined as an interfacial dislocation that generates the 
transition between the two lattices and is said to accommodate the misfit /. 
Essentially it replaces the long-range stresses due to the MS by the oscillatory 
strains of the MD’s to reduce the overall energy. There are several choices of MD 
arrays that will accommodate the same misfit. Minimum energy considerations 
coupled with barriers to MD generation determine which of the available choices 
will be realized. The most common MD’s have an edge component of the Burgers 
vector b that projects along the direction of interfacial misfit. The most efficient 
MD has its Burgers vector in the interface and is therefore one of pure edge 
character. It is referred to as an efficient or perfect MD, and was first observed by 
TEM in PbS/PbSe bicrvstals [28]. It is geometrically possible to accommodate a 
pure misfit boundary of (001) orientations between orthorhombic crystals with a 
crossed grid of screw dislocations [29] if the crystals are rotated so that the a-axis 
of the one is parallel to the fi-axis of the other and the misfits in the two directions 
are therefore equal but of opposite sign. This same misfit boundary can also be 
accommodated by a crossed grid of perfect MD’s. While the above considerations 
show that MD’s are not restricted to any particular type of dislocation one usually 
expects MD’s to have edge character. 

The first experimental evidence for misfit accommodation by MD’s was ob- 
tained from etch-pit analyses of germanium crystals with small gradients in 
chemical composition [30]. Examples of misfit accommodation by only inefficient 
(or imperfect) misfit dislocations (IMD) (Burgers vectors lying out of the 
interface as shown in (fig. If) can also be found [31]. It is also possible to have 
both inefficient and pure misfit dislocations present concurrently in the same 
interface [32]. Indeed, accommodation of misfit by dislocation half-loops and 
inefficient dislocations that have glided into the interface has been documented 
[33-35]. Clearly then, the mechanisms available for the introduction of the 
various types of MD’s is of significant practical importance. It is now commonly 
observed that MD’s occur in most epitaxial systems, as has been reviewed by 
Matthews [6, 14]. 

The last mechanism for accommodating misfit to be discussed here is that of 
ledges. Often substrates are oriented off major crystallographic planes so that 
steps in the surface are available for nucleation sites. Further, the appearance of 
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growth steps and ledges at interphase boundaries is common [36]. The occurrence 
of stepped interfaces to improve the matching of interphase boundaries with misfit 
has been known for some time [37-40]. The suggestion that steps associated with 
terraces may accommodate misfit at an epitaxial boundary has also been made 
[41]. It is only recently that the energy of a misfit boundary with ledges has been 
calculated [42]. A planar misfit boundary may improve the overall interfacial 
matching if a sequence of steps, that effects a relative displacement of the atomic 
patterns on either side of the interface so as to insure that disregistry is never 
greater than about one quarter of an atomic spacing, is introduced [42, 43]. A 
detailed treatment of this problem is in progress [44]. 


1.3. Goals 


The main goal of the present paper concerns the accommodation of misfit by 
MD’s for the case of a pure, flat misfit boundary. It is the MD’s that are the 
center of attention because of their importance to electronic device performance 
and reliability. If MD’s and MS(e) coexist, the portion of the misfit / accommo- 
dated by MD’s being /, one may write for each interfacial direction of a thin 
epilayer A on a thick substrate B (to first order) [14] 

/ = /+ e ; e s \e\ (6a) 


the corresponding quantities in the two directions being equal when the interfacial 
symmetries are quadratic, but otherwise unequal. In the Frank and van der 
Merwe formalism [46] 


/ = 


a - b 
b ’ 




(6b) 


where a and b are the normal atomic spacings in A and B and a is the misfit 
strained spacing in A; B, being thick, is unstrained. When both crystals are thin 
provision must be made for MS in both crystals. Furthermore, coherence may 
exist in one direction and disregistry with MD’s or a MV in a perpendicular 
direction [8]. An implicit assumption, also in eq. (6a), is that the temperature 
remains fixed at the growth temperature, or the half crystals A and B have the 
same thermal expansion coefficients. Otherwise the equilibrium MS e may 
decrease, or increase or even change sign. Technologically this is undesirable and 
provision must be made to fit both crystallographical misfit and thermal expansion 
coefficient [45], 

The goal has thus been reduced to describing existing considerations on the 
conditions that determine the distribution of /between / and e. Here it suffices to 
report [46] that under certain conditions there exists a critical misfit f[ l) so that a 
monolayer (ML) with /</ [ l) grows coherently (/ = 0) while MD’s are present 
when / > f { c l) . If for / </ ( c n more ML’s are added a critical thickness h c is reached 
above which there is a transition to MA with MS and MD’s jointly. Conversely 
for given h-h c the misfit / is critical, i.e. for / > f(h c ) = f c (h) there will be a 
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transition from coherency (registry, commensurate configuration) to incoherency 
(disregistry, incommensurate configuration). The goals of this chapter are to 
review and appraise existing calculations of f c and h c with special emphasis on 
their dependence on the physical properties of the bicrystals concerned and to 
their correlation with experimental data - particularly the influence of the ex- 
perimental conditions on this correlation. 


1.4. Governing physical principles 

The systems (vapors, crystals) and processes (adsorption, desorption, adatom 
diffusion, MD generation and motion) involved in epitaxy fall respectively in the 
categories of thermodynamic systems and rate processes. Therefore at an inter- 
mediate stage any internal change under constant external constraints brings the 
system nearer to the equilibrium (stable) minimum free energy (A) configuration. 
The process is driven by free energy gradients - A A and progress is facilitated by 
thermal (temperature T) fluctuations to overcome a hierarchy of energy barriers 
U (e.g. adatom migration activation energy, and energy barriers towards nuclea- 
tion and motion (Peierls) of dislocations) separating consecutive metastable 
configurations. Thus the closeness of an observed configuration to equilibrium or 
the degree to which it had advanced towards equilibrium, will otherwise depend 
on the deposition rate (supply rate of adsorbate). 

Since the analysis of equilibrium configurations on the basis of minimum free 
energy, i.e. 

A = minimum (equilibrium) , (7) 

is so much easier than the dynamics of non-equilibrium processes, most theoreti- 
cal predictions of the past were equilibrium based, the assumption being that in 
many practical cases sufficient time had been available for the system to come 
close to equilibrium. The perception is that knowledge of equilibrium configura- 
tions will also provide useful guidelines of tendencies. Additional simplification 
had been introduced by noting that, because of the regularity of the crystals and 
of the MD arrangements adopted, the energy (£) contribution to A is dominant 
and hence that the equilibrium configurations may be adequately characterized by 
the condition that (6, 8, 46] 

£ = minimum (equilibrium) , (8) 

Clearly the applicability of the theory requires the barriers to be small enough as 
compared to thermal energies. 

In the problem of epitaxy the scale of £, which has often been referred to as 
the interfacial energy, is primarily determined by the strength of the AB 
interfacial interaction and secondarily by the misfit. The primary goal has 
therefore been to find the dependence of £ on these parameters. The basic 
assumption is then that the structure (mode of MA) is determined by the minima 
of £ with respect to these parameters. 


W. A. Jesser and J. H. van der Merwe 


Ch. 41 


mp'lo 

1.5. Growth modes 

Of great importance to the consideration of critical size, e.g. critical thickness, are 
the epitaxial growth modes that have been named by Bauer [47]: Frank and van 
der Merwe (ML-by-ML, 2D) growth, Volmer-Weber (island, 3D) growth and 
Stranski-Krastanov (island on top of a few MLs in 2D mode). Also the growth 
mode is partly controlled by the prevailing degree of non-equilibrium as expressed 
in terms of supersaturation. The supersaturation increases with decreasing sub- 
strate temperature T s and increasing deposition rate. The equilibrium criterion for 
the growth mode can be expressed in terms of the specific surface (y A , y B ) and 
interface (y AB ) free energies [47]. or bondings [48] per unit area E AA and £ AB as 

A Tab = Ta + Tab - Tb « 0 or £ AA “ ^ab « 0 for 2D growth 
>0 > 0 for 3D growth , 

where B is the substrate and A the adsorbate. For growth of the (n + l)-st ML on 
the n-th ML, Ay AB must be replaced by Ay AA — 0. However, Ay AA may even 
become effectively positive due to influence of the nearby substrate. This is an 
important reason for Stranski-Krastanov growth [49]. Equations (9) are simply 
saying that when AB bonding is strong compared to AA bonding the tendency 
will be to form AB rather than AA bonds, i.e. lateral rather than vertical growth. 

The formation of smooth and planar interfaces and of uniform epilayers are 
highly desirable, particularly in superlattices consisting of thin alternating A and 
B layers, and is facilitated by 2D growth [48,49]. In this case one half of the 
growth cycle will inevitably be 3D-wise at equilibrium. 3D growth can be modified 
to occur 2D-like by having the appropriate supersaturation. The smoothness of 
the interfaces will furthermore be facilitated by a choice of materials for which the 
degree of three-dimensionality is low [48] as may approximately be expressed in 
terms of a surface free energy “mismatch” [49]. Since most practical applications 
require smooth planar interfaces and uniform thicknesses, the fabrication pro- 
cesses are tailored to achieve these goals. This lends more credibility towards the 
usual practice of predicting critical misfits assuming layerlike growth. 


1.6. Models 

Almost all the earlier theories modeled the interaction (A A and BB) within the 
crystals in terms of continuum elasticity (harmonic approximation) theory 
[4,6,46]. Subsequently anharmonicity has been introduced [50,51], and more 
recently ab initio atomic calculations, using appropriate potentials, have also been 
carried out [52]. 

The earlier theories adopted essentially two approaches for modeling the 
interfacial (AB) interaction: the Volterra continuum model in which A and B 
interfacial atoms undergo relative displacements parallel to the interface that are 
limited to integer values of the (reference) lattice parameter, and the Frenkel- 
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Kontorova (FK) model [53] that views the interfacial atoms of A to be in a 
(sinusoidal) periodic field of crystal B and vice versa. The Volterra model (VM) 
which has been extensively and profitably employed by Matthews [6, 14] has the 
advantage of versatility and mathematical simplicity over ail regimes of thickness. 
Its main disadvantages are (i) its inaccuracy at small thicknesses and (ii) that it 
does not allow for the widely varying interfacial (AB) bond strengths which are 
important for thin epicrystals. The FK-model on the other hand makes appropri- 
ate allowance for AB interaction and has fairly simple analytical solutions for the 
extreme cases: (a) a ML on a thick substrate for which the governing equations 
reduce to sine-Gordon types [46, 53-55] and (b) two thick crystals [4, 56], but has 
to resort to approximations for intermediate cases except when the periodic AB 
interaction is represented by parabolic arcs [56-57]. 

The model introduced by Frenkel and Kontorova is one-dimensional and was 
first applied to epitaxy by Frank and van der Merwe [46] in their classic paper 
concerning misfitting monolayers on thick (essentially rigid) substrates. It was 
subsequently extended to two interfacial directions [8,46,54,55]. As in the 
FK-model the complexity of the analysis was limited by truncating the corre- 
sponding two-dimensional Fourier series of the periodic interfacial (AB) potential 
at low harmonic order, a practice which has now been properly justified [58]. This 
extended model has been used to show [8] (i) that minima in the (free) energy of 
an epitaxial bicrystal occur when the interfacial structure and dimensions, and the 
relative crystal orientations, are such that parallel interfacial atomic rows of the 
two crystals are equally spaced, irrespective of whether in the rows the atoms are 
in registry, (ii) that the depth of the free-energy minima may be directly 
correlated with the values of specific Fourier coefficients and (iii) that the depths 
are additive should matching also occur in a perpendicular direction to achieve 
complete 2D registry. Thus, while the values of the Fourier coefficients are 
indicative of the maximal energies that can be gained (the tendency to epitaxy) 
from row matching in one or two dimensions, the matching itself predicts 
precisely for given crystal structures the orientations for which epitaxy is ob- 
served. Naturally the net gain will be reduced by strain energy if the matching is 
achieved by MS. 

The difficult part of the relevant analysis is to find expressions for the self 
energies (line tension) E D per unit length of a MD. In the FK model and its 
generalizations the analytical techniques have been different for a ML on a thick 
substrate and for two thick crystals and approximate otherwise. Because of its 
frequent use it is convenient for later reference to give the expression of £ D for 
two thick epicrystals [4,56]: 


£ d = 


VabCP 


P = 


47 T 
27rA 0 c 
PabP 


[1 + /J - (1 + p 2 )' 12 - p ln{2j3(l + p 2 )' 12 

1 _ 1 - «/, { 1 - ^ 

M„ Mb 


2 P 2 }}, 


(10a) 

(10b) 


where n ah , n a and fi b are respectively shear moduli for the interface and crystals 
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A and B , v a and v h Poisson's ratios, c ** *(a + b) \ s a reference lattice parameter 
and p the MD spacing. When p is large (dp< 1) eq. (10a) becomes 




(10c) 


In the Volterra model the self energy per unit length of a MD between an epilayer 
of thickness h d and a thick substrate is given by [6. 14] 



D = A ( ,b/2ir , 


where R is the “cut-off radius” usually taken to be 


(11a) 


/? = 


pH 

K 


when /i a > p ! 2 , 
when h z ^ p ! 2 , 


(iib) 


and the Burgers vector b is essentially the c in eqs. (10). We note the close 
similarity between (10c) and (11a) with R replaced by p„ h plA-n\ u ~p/10 for 
crystals with similar atomic interactions instead of pi 2 as in (lib). 

This expression for £ D can be adopted for the line energy of a MD or used to 
calculate the energy per unit area, 

Eq — Pj^ E D + Py E D 

= 2 E D lp for quadratic symmetry , (12a) 

of an interface for a network of MD's assuming that the crossing energy is 
negligible [6, 14]. When the misfit is partly accommodated by MS(e) the MS 
energy, which is of the form [6, 54, 59] 

2//.(l+v) 

Ei = Bh^el = — — h a el per unit area . (12b) 

V a 

for interfaces with quadratic symmetry, must be included. For this symmetry the 
principal strains e are equal and the shear strain vanishes. When both crystals are 
thin each will be misfit strained and contains MS energy of the form (12b) 
[6,57,60], In any case the total energy is 

E = E d + Ei (12c) 

where (12a) and (12b) are special cases of E D and E i respectively. 


1.7. Criteria 

Various criteria had been used in the past to calculate the critical misfit f c and 
critical thickness h c . These have recently been analysed in some detail [61]. It is 
therefore necessary that we understand the basis of each and put them into 
perspective. We begin by considering equilibrium criteria. For this purpose we 
may display the functional dependence of the total energy of a thin epilayer A of 
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uniform thickness h on a thick substrate as £(/;, /; / or e). We may further 
minimize E by varying the MS(e) (or equivalently the MD density /in eqs. (6)) at 
constant h and /, obtaining from eq. (8) the governing equations [6,46] 


Q= ~dt = ~df' f0re = e - 3nd ( l3a ) 

This defines the equilibrium values e m and / m of e and /. 

The form (13a) of the equilibrium condition clearly impliesextensive interfaces 
with many MD's so that it is meaningful to speak of varying/ continuously rather 
than discretely. The second part of eq. (13a) lends itself to an interpretation that 
not only overcomes this difficulty but in some cases also leads to a drastic 
simplification of the analysis: if E does not change with/, it is saying that the work 
W done to create the MD vanishes, i,e. 




j 


Fds = 0 


for e = e m and ; 


(13b) 


for e 5 e m the work W $ 0 respectively. In eq. (13b) F is an external force needed 
to generate the dislocation and ds is the surface displacement, or F is the 
integrated force on an existing dislocation (extending from the interface on the 
free surface or other interface) from which the interfacial MD is created (drawn 
out along the interface) and ds is an infinitesimal displacement of the (existing) 
dislocation line. Both F and ds are measured in the direction of increased length 
of MD line laid along the intersection of the interface and the slip plane of the 
“threading" 1 segment of the crystal dislocation in the strained epilayer. It is the 
glide of this threading dislocation (TD) that creates increased length of MD line; 
the glide direction and direction of “increased' 1 MD length are the same. When F 
is constant (independent of the displacements) the equilibrium equation (13b) 
reduces to [6, 14] 

0=F=F,--F d , (13c) 

where F i is the MS-induced Peach-Koehler force on the TD and F D the resistance 
due to the MD line tension. F may also be an external force needed to inject a 
MD and ds correspondingly a displacement of the point(s) of application of F as 
in refs. [46, 54, 55]. 

The problems associated with the forces between MD’s in an interface of finite 
width between two cylindrical half crystals, their interplay with misfit and MS, 
and the mechanics of consecutive incoming MD’s, have been analysed in some 
detail by Nabarro [62]. These provide useful insights into the behavior of MD’s at 
the onset of coherency breakdown. 

Equations (13) are three equivalent criteria (except that (13c) is a special case) 
for the equilibrium distribution of given misfit /between MD’s (/) and MS ( e ) at 
given thickness h of epilayer. As h decreases, the MS energy for fixed e decreases; 
alternatively the MS e for fixed MS energy increases so that the density / of MD’s, 
needed to satisfy eq, (2), diminishes until it vanishes (/= 0) at a critical thickness 
h c [6, 14]. Conversely f c = / is the critical misfit at given thickness h = h c . The 
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conditions for the critical misfit f c at given h and the critical thickenss h c at given /, 
are respectively [6, 57] 

0) = 0 and *(/, /i c ; /= 0) = 0 , (14) 

where <f> is one of the functions of eqs. (13). 

Consider the introduction of MD's into an initially coherent epilayer. The line 
tension increases the energy according to £ D in eqs. (10) or (11) and the 
Peach-Koehler force decreases the MS energy E- in eq. (12b). Furthermore, in 
the FK-model, at equilibrium there are no MD's (/ = 0) below the critical misfit / c 
and many - almost enough to accommodate all (/ = /) the misfit - above f c [46]. 
On the basis of these considerations it has previously [6] been assumed that the 
condition for f c is approximately equivalent to the condition that / = f c when all 
(e =/) the MS energy E in eqs. (12b) can be traded for MD energy in eq. (12a) 
with / = / [56]: 

/.; e=f) ~ E D (h, / c ; / = /) , (15) 

instead of a configuration where e and / coexist. This approximation has sub- 
sequently been used by other authors, as dealt with below. Recent calculations 
[57] have however shown that this approximation is rather inaccurate and may 
overestimate the critical misfit by as much as 25% and the critical thickness by as 
much as 35%. 

The foregoing considerations apply to the case where the availability of MD’s is 
not an obstacle [6,46]. Normally the introduction of MD's is subjected to the 
overcoming of energy barriers [46] 

U= j Fds . (16) 

bar 

The most prominent barriers are associated with the nucleation of MD and the 
Peierls stresses opposing their motion [6, 14], The barrier heights are reduced by 
the misfit stress (Peach-Koehler forces). When U vanishes, i.e. 

U = 0, (17) 

MD’s generate spontaneously without the aid of temperature fluctuations [46]. 

When U > 0 the acquisition of MD’s depends upon the thermal energy kT. 
When U s 30 kT the barriers are completely prohibitive and the system will 
remain almost indefinitely in a metastable configuration. When U <30kT the 
attainment of equilibrium takes time and the observed configuration may differ 
significantly from equilibrium. 

2. Calculations-, equilibrium 

The criteria employed for calculating critical parameters as defined in eqs. 
( 14)— ( 17) may be broadly separated into equilibrium and non-equilibrium 
criteria. The equilibrium category can be subdivided into (i) energy criteria when 
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the energy values are compared as in eq. (15) or the energy is properly minimized 
as in eqs. (13), (ii) stress or force criteria as given in eq. (13c) and (iii) 
geometrical criteria where good "matching" is implied to be equivalent to low 
energy. The non-equilibrium category, which recently became the subject of 
intensive study, can be subdivided into calculations concerning (i) nucleation 
energies of MD's (ii) other energy barriers and (iii) the reduction of the 
activation energy by MS until it ultimately vanishes as in eq. (17) and MD's 
generate spontaneously, and (iv) the solution of the rate equation to establish the 
degree of progress at any time t towards achieving equilibrium. Often diverse 
approaches are encountered within each of these categories. 


2.1. Energy criteria 


2.1 A, Precipitates 

The first calculation of a crtical size was made by Nabarro for the case of a 
misfitting sphere in a crystalline matrix [63], Although this is not really an 
example of “epitaxy" it is directly related and. being the first, warrants some 
consideration. Nabarro essentially employed the criterion (15) and equated the 
elastic strain energy associated with the coherency strains, MS, to the rise in 
interfacial energy that occurs when coherency is destroyed. The interfacial energy 
was estimated by the energy required to melt a monolayer at the interface. The 
same model was employed by Jesser [64] but with a more realistic expression for 
the interfacial energy. The sum of the misfit strain energy and interfacial energy 
was minimized as in eq. (13a) to find the critical radius above which coherency 
loss is energetically favored. In this case the interfacial energy expression of van 
der Merwe was used. A similar model in which the interfacial energy is replaced 
by the energy of a single Volterra type prismatic misfit dislocation loop of radius r 
equal to that of the misfitting spherical precipitate (r p ) provides yet another 
expression for the critical radius r c [65]. The authors considered the nucleation of 
a prismatic dislocation loop within the precipitate, its abwth by climb and its 
ultimate positioning on an equatorial plane in a circle 6t radius infinitesimally 
smaller than r p . Its energy of formation E { ~ E ( - A £- is approximately given by 


E,= 


2(1 - v) 


(";H 


nr\Atibf 
l + 4p/3 K ' 


(18a) 


where E ( and AE f (e = f ) are respectively the (Volterra) self energy of the loop 
and the MS energy release in the precipitate and, /x, K, v, b and / respectively the 
shear modulus, the bulk modulus, Poisson’s ratio and the Burgers vector (all of 
the precipitate) and / the misfit. From (18a) two alternative critical radii may be 
deduced: 


6(1 + 4/n/3K) 
87r/(l - v) 


r 1 , 8r c i 

- In -rr + s ; 
s e~b J 


1 , 2 , 


(18b) 


the one ( s - I) when E { = 0, i.e. the formation becomes energetically favored (see 
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eq. 15), and the other (5 = 2) when dEJdr = 0, i.e. the precipitate can support an 
already existing loop at its surface. Note that eqs, (18b) are implicit equations in 
the unknowns r* ' = r . 


Symbol 

CA9 W, 


2.1.2. Layer-like growth 

The first calculation of critical misfit and thickness in epitaxy was made by Frank 
and van der Merwe [46] in their analysis of a one-dimensional Frenkel-Kontorova 
model [53]. In this model the competing forces positioning the adatoms were 
modeled as follows: the adatom-adatom interaction (AAI) that favors the 
crystalline structure of the overlaver by. harmonic (approximation) forces of force 
constant /i (1) and the periodic (periodicity b) adatom-substrate interaction (ASI) 
by a one-dimensional Fourier series (truncated at first order) of overall amplitude 

® H 

v= 50[1 -cos( 277 Jc/ft)]. 

Equilibrium considerations (eqs. 13a, b) yiel^a critical misfit f[ l) of 




«" b %=H (,) b 2 /2W, 


(19a) 

(19b) 


The appearance of ju (n and W 0 , as measures of AAI and ASI respectively, 
stresses the importance of interaction strengths in determining/.. 

These results can be extended to a thickening overlayer (crudely though) by 
assigning to a layer of multiplicity n the force constant Equations (19) in 

conjunction with (14) accordingly predict a critical thickness [46] 

h c = SW 0 l7r 2 fi n) bf 2 . (20) 



The authors estimated f[ l) to be about 9% when the AAI and the ASI's are the 
same. The extension of these considerations to non-equilibrium phenomena will 
be discussed in section 3. 

Although the foregoing model is one-dimensional it has proved to be quite 
successful and has been the basis and stimulus for numerous extensions to more 
general cases. The most obvious extension was to two-dimensional model mono- 
layers (ML's) [8,46,54], This extension introduced Poisson's ratio v A of the ML, 
whereby eqs. (19a) and (19b) became 

= 2/^ yrnr . (2ia) 

(f= (21b) 

for cases with quadratic interfacial symmetry, now being the shear modulus 
and h the thickness; h = a for a ML. Critical thicknesses as in eq. (20) may also 
be defined for this two-dimensional extension. The effect of Poisson's 
phenomenon is essentially to reduce the critical misfit; in the average case (AAI 
and ASI the same) from f[ l) = 9% to about 7%, with an analogous effect on 
critical thickness. 
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A relevant case that had been very topical recently is the epitaxy of metals at 
(111) fcc/(110) bcc interfaces (7, 8. 49. 55] where the misfit differs greatly in two 
mutually perpendicular directions and the occurrence of coherency in one direc- 
tion only is a natural consequence. Loss of coherency for this case has been 
studied extensively on the ML or quasi-ML level as for eqs. (21 ). Also in this case 
a “configurational” parameter € of the form in eq. (21b). containing the bond 
ratio txb 2 /( 1 - v)W {) and thickness h. together with the nearest-neighbor distance 
ratio r = a[fcc (lll)]/b[bcc (110)] rather than the misfits explicitly, enter natural- 
ly into the theory. When f is small enough a ML grows with 2D coherency for a 
wide range of r values. As h , and accordingly increases there is a critical 
thickness h c at which there is a transition to ID coherency in the Kurdjumov- 
Sachs (KS) orientation at a critical thickness h c KS when r is near 1.09 (small misfit 
f c K s ) or to ID coherency in the Nishiyama-Wassermann (NW) orientation at /i^ w 
when r is either near 0.94 or 1.15 (small misfits fL «,)• The transition to the KS 
orientation requires a rigidlike rotation of about y~ and its realization may thus 
be impeded. Furthermore, higher critical thicknesses for loss of the residual ID 
coherencies exist. These critical thicknesses are only known semi-quantitatively, 
like h c in eq. (20). In all these cases the Poisson effect is important, particularly in 
the MS energy. The role of Poisson’s ratio on critical thickness in a variety of 
cases has also been studied by Jesser and Kuhlmann-Wilsdorf [59]. 

The most useful extension to the considerations of Frank and van der Merwe 
was to thickening epilayers, including combinations with different elastic con- 
stants (56.57.61.66]. Additional improvements to the model have been made in 
the form of more accurate representations of the MD energy, as has recently been 
reviewed [67]. While the extensions of the model to very thin epilayers (less than 
about two ML’s) and to very thick ones are acceptable, it is poor in the regime 
exceeding two ML’s; particularly when the misfit is small. It is in this regime that 
the approximations employed in the calculations introduce significant effects 
because the energy of the MD’s in the interface is given by an expression that 
relies on the epilayer thickness being greater than half the spacing between MD’s. 
This limitation led Matthews to adopt a Volterra dislocation approach to calculate 
the energy of an array of MD’s [6, 14]. The Volterra model employs a “cut-off’ 
radius R to the dislocation strain field defined with regard to eqs. (11). Since near 
/ = / c the MD spacing becomes large the finite thickness h of the epilayer A must 
be taken as R. Minimization of the total energy in eq. (12c) yields for f c at given h 
(using the relations (14)) the result [6, 14] 



or, alternatively for the critical thickness h c at given misfit / the implicit equation 



(22b) 


for a bicrystal AB with quadratic interfacial symmetry. Note that relations (22) 
imply that v a = v b = v. Also no dependence on interfacial interaction is incorpo- 
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rated. This approach is good for thick epilayers but is inaccurate when the 
epilayer is thin because the role of the ASl is significant here, and the strain field 
of the MD can extend along the plane of the interface to greater distances than 
the cut-off radius, which is equal to the film thickness. However, it has the great 
advantage of simplicity in its representation. 

The calculation of critical thickness by energy minimization has also been 
carried out for partial MDs. Such MD's will have stacking faults associated with 
them. The stacking fault will extend through one of the crystals and will terminate 
at the MD in the interface. This case has been treated in the model of van der 
Merwe using eq. (10) modified for the incomplete Burgers vector [68], as well as 
from the Volterra dislocation-energy point of view using force-balance criteria 
[6, 14]. The model of van der Merwe makes provision for a variable interaction. 
The effect of generating incomplete MD s associated with stacking faults is to 
reduce the critical thickness. This can be understood as the result of the shorter 
length of Burgers’ vector reducing the energy of the MD’s to a greater extent than 
the stacking fault raises the energy. Negative stacking-fault energy and lattice 
parameter changes normal to the plane of the fault will also affect the critical 
thickness, but are usually not considered in the analysis. 

The question of critical thickness h c has also addressed more recently by 
Bruinsma Zangwill [69] using the Volterra approach to the energy of a MD but a 
different analytical technique. They included anisotropy effects, but made the 
simplfying approximation that there exists an “easy” direction for loss of coheren- 
cy by the introduction of a single sequence of MD's. One would expect this to 
occur for epitaxy at (111) fcc/(110) bcc interfaces characterized by widely 
different misfits in two perpendicular interfacial directions. The authors obtained, 
like Matthews, an implicit equation of the form (22b) but with somewhat different 
entries and give their (approximate) result as 

h c =Kjf, (23) 

where k 4 is a “constant” depending on elastic constants as in eq. (22b). 

Only one model exists, i.e. the one in which the periodic adatom-substrate 
interaction potential is represented by a succession of parabolic arcs [56,57], 
rather than a truncated Fourier series, that has been solved exactly for all 
thicknesses ( h ) and strengths of atomic interaction. In this model the effect of 
unrealistic potential maxima (high cusped peaks) on the prediction of critical 
thickness is drastically reduced because at the critical thickness the MD’s are 
widely spaced and the contribution of the peaks to the overall energy is 
presumably small. The critical misfit for given h and materials with identical 
atomic interactions is now given by 


= \-v f d* 

4tt(1 + v) J X[X + a<f>(X, a)} ' 

o 

1 *(s 2 + cs-*-X-) + 5(c + :?) 2 /2(l-2t;) 

^ ,0) 2 *(s 2 -* 2 ) + 8(* 2 + sc+A'-A' 2 )/2(l-2»') ’ 

8 = 2(\ - v)hla , s = sinh X , c = cosh X . 


(24a) 


(24b) 
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For different elastic constants and thicknesses the critical misfit f c has the same 
form as (24a), but now the function (f> is vastly more complicated [70], Alterna- 
tively, for given misfit (/ c = /) this is an implicit equation for the critical thickness 

v 


2.1.3. Island growth 

The foregoing case of uniform epilavers covering a substrate is of great practical 
importance. In the initial stages of epitaxial growth, however, most systems 
exhibit island growth, which may be Volmer-Weber or Stranski-Krastanov 
growth [47]. If the misfit is not small or if the thickness at which the growing 
deposit becomes nearly continuous is large then loss of coherency will tend to 
occur when the deposit is in the form of individual islands. The coherency strain 
energy for the case of island deposits was first calculated by Cabrera [71]. This 
model included elastic strains in both the island and the thick substrate. It was 
used to calculate the critical radius above which it is energetically favorable to 
introduce MD’s [59]. The calculation of critical misfit and thickness in this case 
differs somewhat from that for layer^rowth. The authors have assumed that at 
the interface between a non-coherent hemispherical epitaxial island A of radius R 
and a thick substrate B MS’s of magnitude e a and e b exist with equilibrium values 
e™ and e™ . The condition for the critical misfit was taken as 

(25a) 

i.e. that almost all the misfit is accommodated by MS. For the case in which the 
atomic interactions are the same ( fx a = fi b = /x) and the interfaces have quadratic 
symmetry (a x = a = a) it follows, using eq. (10a), that for given R (approxi- 
mately) 


3fr(4a 0 + aj 
*(*o + *s) 


B \n[2(3(l + /3 2 ) 1 ' 2 — 2/3 : ] , 


(25b) 


P = 


(1 -p)R 


(25c) 


where a 0 and a s are the nearest-neighbor distances in overlayer and substrate 
respectively. This is an implicit equation in f c . For given misfit /, eq. (25b) defines 
a critical size R c . 

Because of the finite size of an island the introduction of discrete MD’s causes 
an abrupt decrease in MS which was observed experimentally in tin deposits on 
tin telluride by Vincent [72], For this case of discrete MD generation the 
calculation of critical thickness and subsequent MD introduction with island 
growth laterally has been considered from the point of view of energy minimiza- 
tion using Volterra MD’s [6], and using the Frank-van der Merwe model [73]. 
Vincent used the criterion that MD’s would be generated at an island size for 
which the MD would reduce the MS to zero. The main significance of the above 
calculations for the present paper is that the island morphology is the ideal one 
for availability of MD’s because an efficient MD can be generated at the edge of 
the island. This mechanism for the generation of efficient MD’s has been 
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confirmed experimentally by Yagi et al. [74]. Therefore one expects the barriers 
to MD generation to play only a small role in epitaxial systems exhibiting island 
formation, particularly at elevated temperatures where dislocation mobility is 
high. In such cases good agreement between experimentally observed values of 
critical size and calculated values is found. 


2.1.4. Superlattices 

In view of the increasing importance of superlattices over the past decade it 
became similarly important to develop an energetic approach to calculating the 
critical thickness for this case. A first approximation was made by applying the 
van der Merwe expression for the energy of an array of MD’s to the stack of 
interfaces [67]. This approximation did not, however, include the reduction of 
MD energy caused by the cancellation of strain fields across the layer between 
interfaces. This cancellation is strongest when the layer thickness is small. 
Further, the expression loses accuracy when the misfit is small. Both of these 
effects, which are relevant to the case of superlattices of layers A and B , have 
been recently accounted for more properly by reworking the interfacial misfit 
boundary from first principles including the boundary conditions applicable to 
superlattices [57]. 

In this approach [57] the following boundary conditions were adopted for the 
interior of an extended superlattice: (i) the normal and tangential forces at the 
interface are almost everywhere linear functions of the corresponding relative 
displacements of atoms on either side of the interface [56]. (ii) the stresses acting 
through the interface are continuous there, (iii) the midplanes between interfaces 
remain plane and contain no shear parallel to the interface, (iv) the superlattice 
faces normal to the interface are free of forces and (v) the crystals may otherwise 
be approximated by isotropic elastic continua. The governing equations of this 
model are exactly solvable. The critical misfit / c , as determined by eq. (14), 
depends on the ratios of elastic constants (Af) and thicknesses (r). i.e. on 


ju= ( ] + ou ~ v Ato 

(1 + v 6 )(l - v a )fl b ’ 



hg=7]C 


(26a) 


and on the thickness of any one layer (taken as 2 h B ), through an equation of the 
form 




(1 -rJO + tf-'M-V 


oh 


4ir(l + v)n h 


* 

/ 


dX 


X[X + 8(r,)<f>(X)] 


(26b) 


where 2 h a is the thickness of layer A, c is a reference lattice spacing, 8(t}) — 
2(1 - v)r) and <}>{X) is a complicated function of the variable X , elastic constants 
and thicknesses. In accordance with the criterion (14) this equation defines for 
given misfit f c = /, and ratios H and M, a critical thickness 17 = t} c . 

Various simple and asymptotic cases of this prediction have been considered of 
which the simplest is the one in which the layer thicknesses and elastic constants 
are respectively equal. In this case 
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c" 

7c + ~x 


cosh 2 X 

sinh X cosh X + X ’ 



(26c) 


i.e. it is no longer dependant on thickness and elastic constants. Also, approxi- 
mately 


/c(t,) = 2^T7) [ Jb arctan 2B + \ ln 



(27a) 


B(v) = [(9 + 4t ? )/6t7] 1 ' 2 , 8 = 2(1 — v)r) . (27b) 


Two conclusions from these results warrant special mention. Firstly it follows 
from a log-log plot that ri c varies approximately as /"*; x ~ 1*22 as compared to 
X * 1 for previous estimates and * = 2 for eq. (20) and the approach of People 
and Bean [75], which apparently agreed with observations of Bean et al. [76]. 
Also this calculation shows that when the layer thickness is small the energy of the 
MD's decreases by a factor of two over that of a single layer on a thick substrate. 
For thick layers, these energies approach one another. Further, since the misfit in 
a superlattice is accommodated equally by homogeneous elastic strains in each of 
the layers when elastic constants are equal, the critical misfit and the critical 
thickness both increase by a factor of up to five over their respective values for 
the case of a single layer on a thick substrate. 


2.1.5. Graded epilayer 

A critical thickness calculation for a bicrystal of a different kind was performed by 
BaH and Laird [77]. The authors considered coherency loss by the introduction of 
6(r^imperfect MD’s in a graded epilayer in III— V compounds, expressing the 
variation of lattice parameters by Vegard's law for a uniform composition 
gradient. They used the Volterra approach in expressing the stresses and calcu- 
lated the work W needed for MD*s to glide in from the free surface; a mechanism 
which also involves the overcoming of surface image barriers. Defining h c as the 
value of h which satisfies the relations (13b) and (14) they obtained (approxi- 
mately) 


h c 



1 + Pn, 

2p m 


(28) 


where d is the average spacing of atomic planes aligned parallel to the interface, 
p m is the value of glh at which W rises to a maximum as t increases and g is the 
distance of the MD from the free surface. The dependence of h c on grading s is 
calculated using the parametric equation (28) h c = h c (p m ) and 5 = j(p m ) (the 
latter from the equilibrium equation), rather than calculating p m directly from an 
extremum condition. In comparing these predictions with experiment the authors 
concluded that even in epilayers without threading dislocations, dislocation 
multiplication, which will inevitably take place, may introduce serious dis- 
crepancies between theory and experiment. 
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2.2. Stress and force criteria 


We now consider in some detail the application of force or stress balance in 
calculating equilibrium critical parameters. We have discussed, in relation to eqs. 
(13) the conditions under which force balance and energy minimization (equilib- 
rium) procedures are equivalent. Neither explicitly involves the nucleation energy 
barriers associated with the acquisition of MD’s. The minimization procedure may 
ignore the barriers because only initial and final energies are relevant. The force 
balance procedure, on the other hand, presupposes the existence of threading 
dislocations (TD’s), so that nucleation is in any case irrelevant. The mechanism of 
gliding a threading dislocation that threads the substrate, interface and epilayer 
on a single slip plane into the interface to make an increased line length of MD 
was first proposed and observed by Matthews [10]. An inevitable consequence of 
this mechanism is that the resulting MD is of imperfect type (fig. 1); as for 
example, when the generating dislocation is a 60^ glide dislocation in the 
semiconductor. The Peach-Koehler driving force F f on the dislocation, due to 
MS, is proportional to the thickness of the epilayer. The force F D opposing this 
glide force is the line tension. Matthews has used the Volterra approach (analog- 
ous to eq. (11a)) to calculate the line tension. When a critical thickness is reached 
for which the resultant vanishes so that the condition (13c) is satisfied, then any 
further increase in thickness will sweep the threading dislocation along the glide 
plane while it draws out a MD along the interface [13]. 


2.2.1. Thin epilayer on a thick substrate 

This model has been improved by adding the line tension a 0 (force F a ) of the step 
generated at the free surface when sweeping the threading dislocation [6, 14]. 
Furthermore, if the threading dislocation splits into partials separated by a 
stacking fault (energy y () per unit area) in the epilayer then the above model can 
be extended to include the relevant force F y needed for stacking fault formation. 
The condition in (13c) may accordingly be used to derive an expression for the 
critical thickness above which imperfect MD’s will be introduced [6, 13, 14,35]. 
The force balance equation (13c) now takes the form 

0=F i + F D + F a + F y 

= Bbhe cos A + Db(\ - v cos 2 a)[ln(/i/A) + 1] + a 0 b sin a + y 0 /i/cos <f> 

(29a) 


and yields for the critical thickness h c (e = /) the implicit equation 

_ £>£(1 - v cos 2 a)[\r\(h c /b) + 1] + a 0 b sin a 
c Bbf cos A - y 0 /cos <f> 


(29b) 


where A is the angle between the slip direction and the interfacial direction 
normal to the line of intersection of the slip plane (S) and the interface, a is the 
angle betweer^line of the mixed MD and its Burgers vector b and <£ is the angle 
between the unit vector n, normal to 5, and the line of intersection of the free 
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surface and a plane which is normal to the free surface and contains n. Note (i) 
that in the term In (h/b) h is the cut off radius assuming that h is less than half of 
the MD spacing and (ii) that for a perfect MD the y (1 term is absent and b is the 
complete Burgers vector. This example illustrates the power of the force balance 
approach. A disadvantage of the approach is that it handles less well the 
interaction between MD’s. 


3.2.2. Superlattices 

Hirth and Evans developed a criterion for critical thickness by considering 
injection of a dislocation dipole or a crack from the edge of a stack of coherent 
layers [78], into a superlattice in which consecutive layers have equal thicknesses 
and elastic constants. They calculated the Peach-Koehler force on the mobile 
segment of a dislocation dipole with one leg in one interface and the other leg in 
an adjacent interface connected by a segment threading a layer on its slip plane. 
This force was equated to the line tension of the mobile segment as was done in 
previous models [15]. The thickness for which this force balance occurs is the 
critical thickness h c that was calculated for isotropic elasticity as well as aniso- 
tropic elasticity. The results of this approach essentially agree with those of the 
earlier similar model [15], and for isotropic elasticity give 


b(\ — v cos 2 a) / V6/t c \ 

— ln l 2^ ;■ 


(30) 


This result is very much the same as the ones already dealt with except that the 
V6 corresponds specifically to the GaAs/GaAs () 5 P 0 5 system considered and r 0 is 
a core cut-off parameter. 


2.2.3. Multilayers 

Bokii and Kuznetsov have also calculated the critical layer thickness for mul- 
tilayered double heterostructures of Ga jr In,_ v P,_ v As v alloys from the point of 
view of balancing the MD line tension and surface step resistance to dislocation 
motion with the Peach-Koehler force on the MD due to the MS [79]. They 
started with the expression for critical thickness by Matthews [80] and allowed for 
elastic bending (a non-vanishing couple) by employing the elasticity calculation of 
Olsen and Ettenberg for the elastic bending of multilayer heterostructures [81] to 
develop a modified expression for the critical thickness of the topmost layer in the 
stack. They obtained for the critical thickness h JC in the topmost layer of a 
multilayer of multiplicity j the expression 

b( 1 - y cos 2 a)[ln(h./b) + 1] + 0,4?ra y (l - y 2 )sin a 
/c 877(1 + v) cos 6 sin 4/(e j + Nb) ’ 

where b , v , a, h } have evident meanings (see also eqs. (29)), is the lattice 
parameter and e j the elastic deformation into the topmost layer, N the number of 
MD’s generated, ij/ the angle between the (111) slip planes in the alloy and the 
free (001) surface and 6 the angle between the MD Burgers vector b and the [110] 
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direction. Note that the line energy of the surface step has been expressed in 
terms of shear modulus and Poisson's ratio; the last term in the numerator. Their 
experimental results are consistent with the introduction of MD’s by the Mat- 
thews mechanism of activating glide of the threading dislocations on their slip 
planes and also with this mechanism occurring in the topmost layer being grown 
at any given time. Comparison of their calculated values of critical thickness with 
their measured values shows reasonable agreement. 


2.2.4. MD formation by climb 

The above arguments rely on the glide of dislocations into the interface. It has 
also been observed that MD’s are introduced into the interface by climb [74, 82]. 
Even at relatively low temperatures climb can be the dominant mechanism for 
relieving MS through the introduction of MD’s. Matthews equated the climb force 
on a dislocation due to MS to those forces that oppose climb, and calculated the 
critical thickness for the generation of MD’s by climb [6, 14]. He included the 
possibility of faulted dislocation loops as well as the extra energy associated with 
creating or annihilating new surface during climb. Since MD’s generated by climb 
would naturally be in edge orientation the climb force is simply Bbhe cos </r 0 + 
Db\n(h/b), where ip 0 is the angle between the glide plane and the free surface. 
Since is naturally small we may approximate the width of the surface strip 
created or annihilated by b and the relevant force due to line tension by ±o 0 b , 
where cr 0 is the surface free energy. It follows as for glide that 

= D[ln{h c /b)+ 1] ± t r„ 

c Bf - y Q /b cos <li„ ( ’ 

Whether or not this mechanism operates will be controlled by the availability of 
point defects provided thermally or otherwise. 

The climb of MD’s out of the interface during interdiffusion of two crystal 
halves of a bicrystal has also been considered [83-85]. These considerations of the 
climb of MD’s have in common that the MD’s leave the interface by climbing in 
both directions i.e. toward both surfaces. This can lead to threading dislocations 
or to a distribution of MD’s throughout a zone of increasing thickness with 
increasing interdiffusion or to polygonization walls. The additional influence of an 
excess of point defects generated by the Kirkendall effect has been discussed 
[6,86]. The Kirkendall effect can either aid or hinder the climb process of the 
MD’s, depending on the relative diffusivities of the atoms. 


2.3. Geometrical criteria 

Geometrical criteria are not true equilibrium [24] criteria. Most of the geometrical 
models for accommodating misfit are of the type introduced earlier [3, 5] which in 
effect relate a closure failure to a dislocation density. This model is usually 
applied to planar interfaces that are narrow in thickness. There exist rather thick 
interfaces over which MD’s accommodate the misfit in a thick transition region. 
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referred to as a continuously graded junction. The an nea le d density /? A of misfit 
dislocations in a cross-section of the graded interface can be related to the linear 
density n L of misfit dislocations in an abrupt interface of the same misfit as that of 
the graded interface when a transitional interface thickness h> () is obtained [87]. 
When the graded interface is sufficiently thin that = 1 ln L then n A = n 2 L . For 
interface thicknesses below the transitional value w {} the misfit dislocation spacing 
is fixed at the value 1 ln L . For interface thicknesses above the anneSlo fl density 
of misfit dislocations is proportional to the lattice constant gradient. These 
considerations are useful for relating dislocation densities in the substrate, abrupt 
interface and graded interface in a unified way. 

The physical basis for these models beyond geometrical considerations is the 
relief of long-range stresses. In effect the geometrical models do not minimize 
long-range stresses by considerations that balance energies or forces but instead 
require that the long-range stresses be zero. Therefore they should be reasonable 
models when the misfit is not small, i.e. in a regime where energy and force 
balance considerations show that the residual long-range stresses should be near 
zero once the critical size is exceeded. In fact it has been shown that the mode of 
MA becomes vernierlike as the misfit increases, with vanishing MS (i.e. longer- 
range stresses), as for rigid crystals. Furthermore, deep minima in the AB 
interaction occurs for one-dimensional matching and maximum depths for com- 
plete registry. This provides energetic justification for the geometrical models [8]. 
The half-widths of the minima are zero for infinite interfaces and increase with 
decreasing interface area. 

The usual task of the geometrical model is to find the required MD density that 
will accommodate the misfit rather than to predict a critical size. It is also useful 
for selecting the various types of MD's that accommodate the misfit [3,29,88], 
There is, however, one geometrical criterion for critical size that is both simple 
and useful for estimating values quickly. Brooks [89] used the classical concepts of 
dislocations to arrive at essentially a geometrical criterion for critical size of a 
misfitting sphere by simply equating the misfit / multiplied by the sphere diameter 
2 R to the Burgers vector b of the MD. Physically this criterion is equivalent to 
requiring the introduction of a MD into the interface when the accumulated 
displacement from the misfit vernier causes an atomic shift to the next available 
potential energy trough, as was noted with regard to eq. (25c). The Brooks 
criterion is a good rule of thumb that gives critical-size values that are not very 
different from those provided by most sophisticated criteria [64]. 

3. Calculations', non-equilibrium 

In the foregoing we have discussed existing calculations of critical misfit and 
thickness based on equilibrium criteria. Empirical data, particularly on semicon- 
ductors, exist that simply cannot be understood in terms of equilibrium principles 
and can only be interpreted on the basis of non-equilibrium considerations and on 
the supposition that the path towards equilibrium goes through one or more, or 
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perhaps a distribution of, metastable configurations separated from each other 
and from the final stable configuration by energy barriers of the kind discussed in 
relation to eq. (16). For example, when the bicrystals are initially free of 
dislocations, MD's need be nucleated, possibly in the form of critical sized loops 
that would grow spontaneously in the presence of MS stresses. Misfit-accom- 
modating dislocation loops have indeed been observed in metals [34, 35] as well as 
semiconducting epicrystals [90,91], The processes of approach towards equilib- 
rium states clearly involve thermal fluctuations, and knowledge of the height of 
the energy barriers, e.g. the nucleation energy U of a MD loop that would 
continue to grow in the presence of the MS induced stresses, and of the rate or 
degree of advance towards equilibrium, are of vital interest. The latter, in 
particular, would assist in distinguishing the equilibrium or non-equilibrium 
nature of experimental data. 


3.1. Monolayer approach 

Both these questions have been addressed previously. Continuum theory predicts 
that U — 200 eV [52,78], which completely rules out the thermal acquisition of 
MD’s in an otherwise perfect crystal. Clearly U will be reduced by the MS 
(Peach-Koehler) stresses in a coherent epilayer. In fact, Frank and van der 
Merwe [46] have shown, using the one-dimensional model that the activation 
energy for the introduction of a MD in a coherent ML is 

= (4W ( / 0 /7t)[(1 -/V„)‘ •’ - f( a arccos fQ (33) 

per atom row of the ML, where the quantities in this relation have been defined 
in eqs. (19). The energy U , has a maximum value of 4V^(jTT~9W 0 di\ f- 0, 
diminishes to about 2M^at the misfit/^ ] = 2Itt(' 0 in eq. (19) where a coherent ML 
becomes unstable and vanishes at 

/[ ,) = l/( 0 = 7 r/< 1, /2 (=14%). (34a) 

In metals W 0 —0.2-0. 5 eV. Thus above / = f $ MD's enter spontaneously without 
the assistance of thermal energy. In accordance with eq. (14) this defines 
(crudely) a critical thickness 

h s = 7r 2 h c /4~~ 2h c (34b) 

above which MD’s will enter spontaneously for given / < /* n . 


3.2. Nucleation of misfit dislocation loops 

The spontaneous generation of MD s in practical cases was also considered by 
Dodson [52]. He realized that the smallness of the critical MD loop radius makes 
a continuum approach invalid and that therefore an atomistic approach is needed. 
He carried out calculations on Si considering a model system consisting of a 
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tetrahedral slab with (111) faces and periodic boundary conditions in the plane of 
the interface using a super unit cell of 7 x 7 atoms extending to a depth of 6 
monolayers. Dodson [ 92 ] employed modified Tersoff potentials using Monte 
Carlo techniques. He concluded that a climb MD loop of 9#^partial having a 
radius of 2 A becomes critical (forms spontaneously) at a compressive misfit of 
11 . 29 ?-. This is greater than the 4% estimated by equilibrium theories and very 
much greater than those which had been observed. Dodson suggests that perhaps 
the most significant limitation of this calculation is that it neglects the influence of 
existing surface steps. A feeling for the magnitude of this critical misfit may also 
be acquired if one compares this value of 11.2% with the critical misfit = 1 1{ {) 
(in eq. (3.4a)) of about 14% for spontaneous generation of edge-type glide MD’s 
in the one-dimensional ML model. In any case, these calculations clearly suggest 
that considerations on the nucleation of MD’s in an otherwise perfect crystal will 
not generate satisfactory understanding of observed non-equilibrium phenomena. 




i 


o 


3.3. Overcoming the Peierls barrier 

More recently Tsao et al. [93] have extended the foregoing work on non- 
equilibrium systems and measured the strain relief in metastable Si,Ge,_ x 
strained layers grown on (001) Ge substrates and concluded from their data and 
the comparison of the Peach-Koehler MS force component par parallel to the 
interface and the corresponding opposing force F D par due to the line tension (self 
energy) of the dislocation (respectively F- and F D of our notation in eq. (13c)) 
“that metastable strained layer breakdown is most directly correlated, not with 
thickness h and lattice mismatch /, but rather with (1) an excess stress r cxc = 
2(/v.p ar - F D p3T )/hb and (2) temperature”. 

Using the expression for the Peach-Koehler force and the Volterra form of line 
tension the authors obtained 

_ 2e/x(l + v) fi(\ - I'cos 2 a) ln(4hlb) _ r 

T «c - 277(1-,) hib~ ; 6=f (35) 

in the foregoing notation, where the angle a between the MD line and its Burgers 
vector is bCr'in the present case. 

Whereas the condition r cxc = 0 is equivalent to F = 0 in eq. (13c) and is still 
valid for equilibrium, the energy barriers temporarily delay the transition to 
equilibrium. At a temperature of 494°C strain relief became observable when 
WM- 0.024. 

Scaling excess stress with the shear modulus /a and temperature in Kelvin with 
the melting temperature T m allowed the authors to construct experimentally 

confirmed stress /temperature stability diagrams which provide a simple unified 

description of the stability-metastability regimes for Si x Gej_ t strained layers. 

Since the derivation of t cxc assumes the presence of threading dislocations one 
may speculate that the relevant barriers are those due to the Peierls stress 
opposing the glide motion. The net driving force (F- - F D ) which consists of the 


o 
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Peach-Koehler MS force diminished by the line tension is proportional to r cxc . It 
reduces the height of the Pcierls barrier [61.94]. When the barrier height is 
sufficiently reduced by increased layer thickness the frequency with which the 
barrier is overcome increases to the extent that strain relief becomes observable. 
Thus the finding of the authors can be understood in terms of fluctuation 
principles. 


3.4. Overcoming the surface barrier 

Sharan and co-workers [95] also recently dealt with the problem of the critical 
thickness-misfit dependence of GeSi alloy epilayers on (001) Si surfaces. They 
considered the inward glide on a (111) slip plane of a long straight 6CPdisIocation 
that had been nucleated “at” the surface, and calculated, using “surface disloca- 
tion” techniques to satisfy boundary conditions, the (surface) energy barrier t/ s 
per unit area of interface (comprising self energy as well as work done by the MS 
stress) which it has to overcome in its passage from the free surface to the 
interface. The barrier U s has furthermore been calculated on the assumption that 
there is a sequence of MD's of sufficient density to accommodate all the misfit (as 
expressed in terms of MD spacing), while not affecting the MS stress. The authors 
adopted the cr^rion that strain relief commences (the critical misfit is reached) 
when E- 5= £/., where £- is the misfit strain energy per unit area of interface when 
/is entirely accommodated by MS. The fact that this approach yields consistently 
higher values of critical misfit than the approaches of Matthews [6, 14] and others 
reflects that the present approach takes into account the opposing image force on 
dislocations when they recede from the free surface and the fact that the criterion 
£ s 2* U s is a non-equilibrium extension of the approximate equilibrium criterion 
(15) which overestimates the critical parameters. Although the theory is not a 
true nucleation theory it involves the concept of an energy barrier characteristic of 
a transition from a metastable to a stable configuration. The approach also does 
not incorporate the Peierls barriers [14,93] thought to be important in the 
semiconducting materials concerned. 


3.5. Quasi-nucleation approach 

Another critical misfit calculation that is significant and had also been controver- 
sial is the one of People and Bean [75], who developed an analytical expression 
for h c to account for the large discrepancy between observed values and those 
calculated from the expressions of Matthews [6, 14] and van der Merwe [56]. The 
authors essentially assumed that the energy £ D of a MD, which they simply took 
to be of screw type [29], is very much localized within a vertical wall spanning an 
interfacial area L x w (L = MD line length and w = MD “width” taken as five 
[110] spacings for Ge x Si_, alloys). They furthermore assumed that h c is reached 
when the portion of £^ contained in the volume of the wall exceeds £ D , This 
approach, which the authors presented as a nucleation approach but clearly is not 
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a true one, also contains elements of the criterion in eq. (15). The approach was 
subsequently also employed by Luryi and Suhir for finite substrates [23]. In 
equilibrium approaches one finds that the critical thickness depends on the misfit 
to a power between - 1 and - 1.5 [57]. The effect of the fixed width of the (screw) 
dislocation is to yield a misfit dependence of the critical thickness near a power of 
-2 as in the crude approach to the result in eq. (20), and apparently yielded 
agreement with their data. In effect this approach is trading all of the MS energy 
for an almost fixed energy barrier whose value was empirically adjusted through 
the dislocation width to fit the experimental data on silicon-germanium alloys 
[76]. To understand the physical details of this approach is difficult but its success 
seems to lie in picking a nearly fixed energy barrier so that the explicit square 
dependence of MS energy on misfit is transmitted to the expression for h c . It is a 
quasi-nucleation approach in which energies in finite volumes are compared 
rather than total energies as for equilibrium. Furthermore, it has recently been 
argued that the authors made serious flaws in the interpretation of the results [96]. 
Because of its controversially this paper had the merit of stirring up great 
activity. 


3.6. Misfit strain gradient (MSG) 

An interesting case of critical thickness concerns epilayers on substrates of finite 
width. It makes use of a significant misfit strain gradient (MSG) to accommodate 
the misfit /. Luryi and Suhir adopted a rigid substrate width of 2£ along the 
one-dimensional misfit direction and an epilayer of the same width to calculate 
the MS energy £ f per unit area of the interface for the case of a coherent 
interface with a strain field decaying exponentially with distance normal to the 
interface [23]. They showed that when the width ( is small in comparison to the 
thickness h , there is an effective thickness h c smaller than h that characterizes the 
limit of the strain field. In other words E f remains finite even though h becomes 
infinite. The authors defined an effective finite epilayer thickness h e by equating 
the MSG energy £,- per unit interfacial area, calculated on the basis of the stress 
distribution on the mid plane, to [£/ 2 /(l - v))h t , i.e. as though there exists a 
Uniform energy “density" up to a distance h e from the interface, obtaining 

*• - *(• - “ ch - ‘"'"i - *[*(£)]’ • < 36a > 

where the function & is such that 

f A when f 8> A , (36b) 

c ~ { f V I it when^<tA, (36c) 

and =3(1 - v)H{\ + v), v being Poisson’s ratio. The quantity defined for 
1, is obtained numerically from (36a) and (36c) and is about 0.89 for 
Ge^i,., alloys, so that A e /^ = 1/4 in this case whereas A e *=A when 

Oh. 

The authors furthermore calculated a critical thickness h f c following People and 
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Bean [75] by assuming that coherency breaks down with the introduction of a MD 
when E; exceeds E n ; E- and E D having the same meaning as in ref. \£f]. This 
yields for given / and ( the equation 



t=h[ 


A = 


20\/2ir<l(l + v) 
(1 ~»)b 


(37a) 


for h f c , where b (=4 A for Ge r Si,_ v ) is the Burgers vector of the MD and a is the 
lattice constant. The solution of (37a) is seen to be a function of filt(£lh') and 
may accordingly be written in the form 

hUf) = h c {Mflh ( c )} (37b) 

with the limiting value h c {f } when (ih[ §> 1. The factor ln(h*/fc) in (37a) results 
from the concept of a cut-off radius (see eqs. (11)) R = h. applicable when £ > h. 
When £ < h the authors adopted as cut-off radius R = £ (= distance to lateral free 
surface) and obtained for £ c the equation in (37a) with t = £ c . In the limit £ < h 
they accordingly obtained (using (36c)) for (37b) the result 

e min (f) = h c {fi/V7r}. (38) 

The authors showed that the curve of /?'(/) vs £ has two asymptotes h c and £ mtn 
and that, at (ib - ( m Jh. h[ i.e. when the critical thickness is 

infinite. Equivalently this value of width. € min , represents the size limit below 
which the misfit is entirely accommodated by a MSG. It then is better described 
as a situation for which there ^xists a critical misfit / c that is a function of 
( These calculations are qualitatively very significant because they 

point to a new mechanism for avoiding MD's. They could be refined by allowing 
relaxation of the substrate and by adopting a more physically based energy 
balance analysis. For example the assumption of a uniform strain /at the interface 
when £ is small may not be realistic [73]. 


3.7. Strain relief by crack formation 

«. 

The acquisition of MD’s by the nucleation of critical sized MD loops at typical 
temperatures has been ruled out. It is interesting though that a calculation of the 
stress intensity factor for a crack gives a critical thickness above which crack 
growth will occur as being near h c for the nucleation of a dislocation loop. The 
experimental value for critical thickness compared to that given by the above 
calculation gave a value near h c obtained from a balance of forces. This result 
points to the growth of pre-existing dislocations as the operative mechanism of 
coherency loss rather than the nucleation of new MD's. There are, however, 
several epitaxial systems for which insufficient dislocation slip is available to 
accommodate the misfit and cracking occurs as an alternative [27]. To model the 
critical thickness for cracking a Griffith criterion was adopted. The critical 
thickness for cracking is equal to the critical Griffith crack length: 

^crack = 2y(l - vfhrEf 2 


(39) 
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in which the surface free energy y of the crack has been approximated by 
y - EaJ 10, E being Young's modulus [59] and a 0 the interatomic distance. The 
third and somewhat extreme case, that the misfit stresses are large enough for 
cracks to form spontaneously, appears to be of less importance. Reasonable 
agreement between experiment and the relation (39) was found [27] for an 
epilayer with a cleavage /normal to the interface. 

Crack formation of a different kind was investigated by Hirth and Evans [78]; 
crack formation starting at the free lateral faces of a stack and propagating along 
the interface. Accepting as postulate for /? crack that crack propagation proceeds 
when the driving stress due to MS exceeds the resistance K inl to crack growth they 
obtained the value 


^crack 


. 2(1 + v)fl J 


(40) 


Using reasonable estimates for K' nx this yields values of the same order as h c for 
the introduction for MD’s. 




3.8. Dynamics of strain relief 

Recently Dodson and Tsao [97] used a phenomenological model of dislocation 
dynamics and plastic flow developed by Haasen and co-workers [98] for diamond 
structure materials to investigate the role of strain relaxation rale in experimental 
data concerning critical thickness in semiconductors. The main features of the 
model are: (i) the epilayer is initially strained to coherence (MS e - misfit/) in a 
metastable configuration, (ii) the MS is generated as a constant biaxial deforma- 
tion relaxed by the introduction of MD network at the interface, (iii) the rate 
process mechanism is the acquisition of MD's; elongation of existing dislocations 
and the generation of new ones, (iv) a localized stress r loc exists that acts as a 
driving force for the motion of dislocations, while the motion is retarded by 
energy barriers needing thermal energy for overcoming them and (v) r loc can be 
expressed in terms of the residual mismatch f-f(t) - e(h); f{t) being the misfit 
accommodated at time t by the insufficient density of MD's (the plastic part) and 
e{h) the equilibrium MS at thickness h , The authors consider a nearly dislocation- 
free coherent layer and make provision for MD sources in terms of a background 
dislocation density / 0 , obtaining the result 

df(t)/dt = Cfi 2 [f-f(t) - , (41) 

where fx is the shear modulus and C a constant depending on temperature. 

The authors concluded that this equation gives an excellent description of the 
data [76] for SiGe alloys on (100) Si substrates using as model parameters 
C = 30.1 and /= 10~ 4 The results are very insensitive to the unknown value for 
/,. For example, while the critical misfit for a 500 A epilayer is 0.004, both 
theoretical and experimental values only deviate observably from coherency at 
/ = 0.1. These considerations constitute significant progress towards a successful 
dynamic description of strain relief, i.e. non-equilibrium critical phenomena. 
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4. Correlation with experiment 

There are numerous techniques by which loss of coherency may be observed. 
Typically the experimental measurements of the critical thickness for coherency 
loss rely on observations of MD’s or such indications of their presence as moire 
fringes in electron micrographs. The roles of the MV, MSG and MC mechanisms 
are usually insignificant and ignored. Further, the measurement of critical thick- 
ness and misfit is not a precise one. Nevertheless a sufficient number of meas- 
urements and estimates have been made to permit the disclosure of some trends. 
We will not consider all of the measurements reported in the literature, but rather 
deal with them in several categories. Metal islands, metal films, semiconductor 
systems and superlattices will be covered, but precipitates will be omitted. For a 
collection of data on the critical size of precipitates one may consult refs.^ijjand 

M 

There are two measurements of the critical size for loss of coherency between 
island overgrowths and their continuous substrates. For cobalt islands on copper 
substrates and y-iron on copper the respective sizes of the largest islands that 
remained coherent at 38 nm and 75 nm [13]. The corresponding calculated values 
are 36 nm and 87 nm [59]. This closeness of agreement between measured and 
calculated values is not typical and therefore may be somewhat fortuitous. 

A number of measurements of h c have been made on thin films of nickel on 
copper substrates and Cu on Ni. Gradmann measured the critical thickness in a 
Ni/Cu vapor deposit as about 1.0 nm [9]. Similar results were obtained for 
electrodeposits of Ni on Cu [11, 99]. Vapor deposits of Cu on Ni showed an h c of 
0.8 nm [100]. The calculated values using both the energy balance and force 
balance viewpoints yield nearly equal values of near 1.2 nm. which are somewhat 
high [14,59]. McWhan reported on h c measurements of 4.0 nm for the case of a 
Cu/Ni superlattice [101]. This value for the Cu/Ni superlattice is about four times 
that for the single layer, a result in agreement with calculation [15,57]. We note 
here that the measured values are less than the calculated ones. The misfit in the 
Cu/Ni system is about 2.6%. 

Systems with larger and smaller misfits have also been studied. Thin films of 
gold on silver which have a misfit of 0.2% have been found to lose coherency 
when the overgrowth is around 28 nm [10]. This value agrees closely with the 
calculated value of 24 nm [59]. Two large-misfit systems have been studied. The 
misfit between films of y-iron and a (111) gold substrate [102] is about 14% and 
the measured value of h c is near 0.1 nm, which is too low by a factor of three. 
However, at these thicknesses the model is known to become somewhat inaccu- 
rate [59]. The Ag/Cu system with a misfit of abot 12% was studied, but instead of 
a measurement of /i c , the elastic strains were measured as a function of film 
thickness down to a value of about 0.4 nm [103]. The deposit was not continuous 
at this thickness but the islands were over 1 |xm wide and hence approximated a 
two-dimensional overgrowth very well. Coherent growth w'as not observed. The 
functional dependence fit well the model of Ball [104] except for very small 
thicknesses where the van der Merwe calculation fits better [56]. The model of 
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Ball is based on a parabolic interaction potential while that of van dcr Merwe is 
based on a sinusoidal interaction potential. The agreement between the measured 
and calculated elastic strains may be taken as agreement between the critical 
thickness approach and experiment. Another way of comparing data to theory is 
to look at the calculated value of critical misfit for a continuous deposit of 
monolayer thickness. The calculation by van der Merwe [54, 56] gives the 
equilibrium value of this critical misfit as about 7% for average atomic bonding. 
At this misfit the introduction of MD’s still requires an activation energy which 
only vanishes at about 10% misfit depending on relative bond strengths [54]. 
From this point of view the Fe/Au results that show coherency at 14% misfit 
suggest that the adatom-substrate bonding is strong in comparison with adatom- 
adatom bonding as is implied by eqs. (19). For Ag/Cu the agreement likewise 
suggests bonding of about equal strength. There is another observation related to 
this critical misfit value. The Pd/Cu and Cu/Pd systems were found to be 
coherent in the initial stages of growth [105]. The misfit for these deposits is about 
7%, a value near the critical misfit for a monolayer and hence in agreement with 
calculation. Only fair agreement with this calculated value of critical misfit is 
obtained from antimony deposits on tungsten [106], Here the misfit is just near 
8% and the critical thickness for loss of coherency was measured to be three 
monolayers for (100) and (211) oriented substrates. No coherent growth was 
observed for (110) oriented substrates. This is likewise consistent with weak 
bonding (smaller amplitude W () in eq. (20)) on the smoother (110) b.c.c. tungsten 
surface. 

The trends indicated by the above comparisons between measured and calcu- 
lated values of critical thickness and critical misfit for metal systems is that the 
agreement is as good as one could expect from the crude models employed. In 
view of the lack of precision in the data and the approximations involved in the 
models, the case of metal epitaxial growth is adequately described by equilibrium 
considerations. The measured and calculated values of equilibrium elastic strain 
agree less well than critical thickness and misfit comparisons between measured 
and calculated values [14]. But even there the disagreement is not too severe. 
Significant discrepancies do occur, however, when one makes similar comparisons 
to those above but for the case of non-metals [14, 107]. The significant differences 
between the two cases can be understood in terms of the availability and mobility 
of MD’s. The metal case has relatively higher values of both as compared to the 
non-metal case. 

The growth of GaAs on substrates of GaAs doped with indium showed a 
measured value of h c near 0.7 p m [108]. The misfit for this system was measured 
to be 3.2 x 10” 4 which yields a calculated value of h c from the force balance 
equation of Matthews [14] of 0.59 pm. The measured value exceeds the calculated 
value by an acceptable margin in this case. Other experiments that favorably 
compare with calculated values of h c may be found in some examples of 
superlattice growth. In a superlattice of PbTe/PbSnTe where the composition of 
the PbSnTe corresponds to a misfit of 0.39% a value of h c near 50 nm was 
measured. This compares favorably to the value of near 78 nm calculated from an 
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equilibrium minimum energy balance approach [57]. A superlattice of GaAs and 
InGaAs for which a misfit of 1 .9% exists showed a measured value of h c as 18 nm 
[109]. The calculated values for this case are 25 nm from the force balance 
approach [15] using the Voiterra model and 9 nm from the equilibrium approach 
[57], using a parabolic interaction potential. Other examples of reasonable 
agreement between measured and calculated h c values for superlattices could be 
quoted for non-metals [15. 16]. It is instructive, however, to look at those cases 
that do not fit the equilibrium calculations. 

Silicon epitaxially grown at 570°C on GaP substrates has a misfit of 4.6 x 10~ 3 
at 570°C and of 3.6 x 10 ' at room temperature and a measured value of h c of 
100 nm [110]. The calculated value using the force balance approach is 33 nm [14]. 
An order of magnitude discrepancy was found for the case of germanium 
deposited at 350°C on GaAs substrates having a misfit of 0.7% and a measured h c 
of 2 p.m [107]. The authors showed that smaller values of residual strain and 
smaller h c values, close to the calculated values of /i c , were observed when the 
specimens were annealed at 600°C for 30 min. Similar discrepancies of the 
magnitude found for Ge on GaAs were observed in magnetic garnet [27, 111]. In 
this case the nearly dislocation-free substrates and high nucleation barrier to MD 
generation led to relaxation of the MS through cracking of the epilayer on 
cleavage planes normal to the interface [27]. 

Two further examples of non-metal layers deviating substantially from the 
calculated critical thickness values are provided by InGaAs grown on InP 
substrates [112] and by SiGe alloys on Si substrates [75]. In the former case a 
number of thickness-misfit combinations were investigated for the presence or 
absence of MD's. A map of the data points was constructed from which several 
deductions can be made. The dependence of /? c on misfit is very steep for both 
positive and negative misfit values. The data show some scatter, but for a 
thickness of 4fim a critical misfit of 1.2 x 10~ 3 is observed and for a misfit of 
2 x 10~ 3 a critical thickness of less than 1 occurs. They showed that cross 
hatch which loosely correlates with MD’s can be imaged several micrometers into 
the InP substrate showing that MD’s penetrate to great depths in the substrate but 
are not present in the InGaAs deposit. The above values of misfit-thickness pairs 
are not in agreement with the calculated values "on the basis of equilibrium 
principles, the former being almost an order of magnitude greater than the 
calculated h c values. This discrepancy coupled with the very strong dependence of 
h c on misfit suggests that the equilibrium model is not applicable to the behavior 
of this system. A similar discrepancy of an order of magnitude between the 
observed and calculated h c values for SiGe alloys on silicon substrates also 
suggests a model based on other than equilibrium principles. 

It is clear from the model of People and Bean [75], and subsequent review by 
Bean [113] as well as from the success of the incorporation of a thermally 
activated process in the model [107], that one must include the nucleation and 
mobility of MD’s in an analysis of a misfit accommodation by MD’s in non-metal 
systems. Cabrera was the first to recognize the importance of the barriers to MD 
acquisition [71]. It is this problem that is receiving renewed attention. The only 
true nucleation theory approaches that incorporate thermal activation for generat- 
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ing MD's are those already mentioned [52,78, 107]. Two of these approaches, 
however, have not included the effect of the mobility of dislocations once they 
have been nucleated [52, 78]. A successful model must incorporate the three basic 
components, nucleation, growth and the ultimate equilibrium state being ap- 
proached. 


5. Discussion and conclusions 

The existence of a natural misfit / between the two crystals of an epicrystal pair 
AB has important fundamental and technological implications. The misfit can be 
accommodated in several modes of which misfit strain (MS; e) and misfit disloca- 
tions (MD’s; /) are the most prominent. MD’s and MS may coexist (/—/ + |e|) 
and knowledge of the criteria for the coherency-incoherency transition (C-IC: 
/ = 0-*/ ^ 0) is highly desirable. Because of continuity considerations it has been 
mathematically expedient to study the inverse transition (IC-C; /VO— ►/ = ()). 
For a given overlayer thickness h the transition occurs at a critical misfit / c , and 
conversely, for a given misfit, at a critical thickness h c . The objectives of this 
paper were to describe (i) the principles governing these quantities, (ii) the 
criteria adopted for determining their values and (iii) the models used in the 
calculations; and also to assess (iv) the successes of (i)-(iii) as judged by their 
correlation with experiment. Clearly the processes effecting the transitions are 
given by the free-energy gradients towards the minimum of free energy (A) 
defining the ultimate equilibrium. Because of its dominance in the free energy 
(A), A is usually approximated by the energy £; thus effecting significant 
simplification. There exists a hierarchy of energy barriers impeding progress and 
the system may thus linger in a metastable configuration for time spans depending 
on the height U of the barrier and the thermal energy (kTJ available. An 
observation may accordingly find the system in a stable or a metastable configura- 
tion. The degree of equilibrium affects two entirely different though basic aspects: 
(i) the growth mode; ML-by-ML s 2D and islands = 3D, and (ii) the status of the 
C-IC transition. 

Because of their simplicity most calculations in the past have been carried out 
for 2D growth at equilibrium. Materials that grow 3D at equilibrium can be 
coerced by non-equilibrium conditions into growing layerlike [48,49]. This pro- 
vides some credibility for the practice of calculating critical parameters for 
systems of uniform thickness. The critical misfit and thickness obtained from 
equilibrium criteria will be denoted by f c (h) and h c (f) respectively, or simply by 
f c and h c . Equilibrium theories assume the unimpeded acquisition of MD’s and 
can be analysed by either energy minimization (eqs. (13a) and (13b)) or equival- 
ently by balancing forces (eq. (13c)). The latter criterion usually assumes that 
crystal dislocations, specifically threading dislocations, are already present. Ener- 
gy minimization has often in the past been approximated by (eq. (15)) equality of 
MS energy (at \e \ =/) and MD energy (at / = /). This assumption may however, 
overestimate critical parameters by as much as 30%. 

The fundamental (thermally activated) processes affecting growth (adatom 
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diffusion) and C-IC transitions (MD nucleation and motion) are entirely differ- 
ent. The activation energy (U) for the latter are normally vastly in excess of those 
of the former, but decrease in proportion to the layer thickness as a consequence 
of the MS induced Peach-Koehler forces. U(h) ultimately vanishes at h s (f)> 
/i c (/), whereafter MD's are formed spontaneously. When h approaches /i s from 
below, U becomes so small that the frequency with which the barrier is surmoun- 
ted with the aid of thermal energy is large enough for MS relief to become 
observable. Clearly the rate of equilibration for growth shapes (involving adatom 
surface migration) and C-IC transitions (involving the introduction of MD's) are 
not coupled. 

Most models employed in the past are of phenomenological nature and have in 
common that they treat the crystals individually (A-A and B-B interactions) in 
the harmonic approximation as linear elastic isotropic continua. Mainly two 
models have been used for the A-B interactions: (i) the Volterra model (VM), 
exploited by Matthews, that assumes the relative displacement u of atomic planes 
on either side of the interface to be uniform and equal to b (the Burgers vector) 
and (ii) the Frenkel-Kontoro^a model (FKM) that adopts an interaction which 
varies periodically in u with periodicity b and amplitude W approximately 
proportional to the AB bond strengths in contrast to the VM where the variation 
is ignored and u = b throughout the interfacial region between consecutive MD's. 
Clearly the FKM is more realistic. Whereas the FKM is solvable with the more 
acceptable sinusoidal variation only for a monolayer (ML) and on a thick 
substrate and for two “thick" halfcrystals, and for all thicknesses only when the 
less acceptable parabolic representation is used, the VM is applicable to ail 
thicknesses and can handle imperfect MD's, partials and the new surface which is 
created in the MD acquisition process. The main shortcoming of the VM is its 
inaccuracy at very small thicknesses: its strain field representation is inadequate 
and it does not provide for a variable A-B bond strength which is not negligible 
at small thicknesses. 

More recently the phenomenological models have been supplemented by ab 
initio calculations using appropriate potentials. Whereas the continuum models 
have general qualitative and semiquantitative predictive power the ab initio 
calculations are limited to the material combinations considered. A prerequisite 
for ab initio calculations is to find accurate interaction potentials. It is foreseen 
that these calculations which have more quantitative significance will generate 
new insights particularly for systems with one or more dimensions of small atomic 
order where the applicability of the continuum models is questionable. 

Experimental observations on metals are in reasonable agreement with equilib- 
rium predictions of f c (h) and h c (f) and also yield acceptable evidence of the 
influence of bond strength, specifically the AB bond strength. The situation is 
somewhat different for strong covalently bonded semiconducting crystals. While 
some show agreement with equilibrium predictions others yield values of h c 
greatly in excess of /i c (/). It is known that the mobility of dislocations is seriously 
impeded by the relatively high Peierls barriers in some of them. e.g. Si and its 
alloys. The theoretical supposition that the barrier height may be reduced by MS 



induced Peach-Koehler forces to the point where MS relief becomes observable 
in the presence of the appropriate thermal energy ( kT s , T s ^substrate tempera- 
ture), seems to be confirmed. The nucleation probability of fresh MD loops is 
practically zero in an otherwise perfect crystal, and they must come from a 
“source" of some kind when they occur. 

It may be concluded (i) that the phenomenological continuum theories have 
played a major role in understanding the physical principles underlying the 
observed critical parameters and will continue to do so, (ii) that ab initio 
calculations will play an increasing role in obtaining more quantitative under- 
standing. and (iii) attempts at solving the dynamics of MS relief, for which there 
is a great need, will be more frequent. 
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